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1 Introduction and main results 

Topological instability of action variables in multidimensional nearly integrable Hamiltonian systems is 
known as Arnold Diffusion. For autonomous Hamiltonian systems with two degrees of freedom KAM 
theory generically implies topological stability of the action variables, i.e. under the flow of the perturbed 
system the action variables stay close to their initial values for all times. On the contrary, for systems 
with more than two degrees of freedom, outside a large set of initial conditions provided by KAM theory, 
the action variables may undergo a drift of order one in a very long, but finite time called the "diffusion 
time" . Arnold first showed up this instability phenomenon for a peculiar Hamiltonian in the famous 
paper g. 

As suggested by normal form theory near simple resonances, the Hamiltonian models which are 
usually studied have the form H(I,<p,p,q) — {if/2) + lu ■ I 2 + (p 2 /2) + e(cos<7 — f) + ep,f(I,<p,p,q) 
where e and \i are small parameters, n := n\ + (I\,l2,p) G R™ x R ar e the action variables and 
(<p, q) — (<pi, tf2, g) £ T" x T are the angle variables. In Arnold's model I\,l2 € R, ui = 1, /(/, (p,p, q) = 
(cosq — I)(sinipi + cos (p%) and diffusion is proved for p exponentially small w.r.t. ^fe. Physically 
Hamiltonian H describes a system of n\ "rotators" and n 2 harmonic oscillators weakly coupled with a 
pendulum through a perturbation term. 

The mechanism proposed in || to prove the existence of Arnold diffusion and thereafter become 
classical, is the following one. For p = 0, the Hamiltonian system associated to H admits a continuous 
family of n-dimensional partially hyperbolic invariant tori 7} = {tp G T ra , (7i,/2) — I, q = P = 0} 
possessing stable and unstable manifolds Wq (7}) = Wq(Ti) — {ip G T™, (h,h) = I, (p 2 /2) + e(cosq — 
f) = 0}. The method used in g] to produce unstable orbits relies on the construction, for /x ^ 0, of 
"transition chains" of perturbed partially hyperbolic tori Tj 1 close to 7} connected one to another by 
heteroclinic orbits. Therefore in general the first step is to prove the persistence of such hyperbolic tori 
Tj 1 for /x 7^ small enough, and to show that the perturbed stable and unstable manifolds W^Tj 1 ) and 
W^iTj 1 ) split and intersect transversally ("splitting problem"). The second step is to find a transition 
chain of perturbed tori: this is a diffucult task since, for general non-isochronous systems, the surviving 
perturbed tori Tj 1 are separated by the gaps appearing in KAM constructions. Two perturbed invariant 
tori Tj and Tft could be too distant one from the other, forbidding the existence of a heteroclinic 
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intersection between W^Tf 1 ) and W^(Tjf): this is the so called "gap problem". In Q this difficulty is 
bypassed by the peculiar choice of the perturbation /(/, tp,p, q) — (cos q— l)f(tp), whose gradient vanishes 
on the unperturbed tori 7}, leaving them all invariant also for fi 0. The final step is to prove, by a 
"shadowing argument", the existence of a true diffusion orbit, close to a given transition chain of tori, 
for which the action variables / undergo a drift of O(l) in a certain time called the diffusion time. 

The first paper proving Arnold diffusion in presence of perturbations not preserving the unperturbed 
tori has been |Q. Extending Arnold's analysis, it is proved in Jl^| that, if the perturbation is a trigono- 
metric polynomial in the angles ip, then, in some regions of the phase space, the "density" of perturbed 
invariant tori is high enough to allow the construction of a transition chain. 

Regarding the shadowing problem, geometrical method, see e.g. |iq| , fill , flijl , and variational 

ones, see e.g. g], have been applied, in the last years, in order to prove the existence of diffusion orbits 
shadowing a given transition chain of tori and to estimate the diffusion time. We also quote the important 
papers which, even if dealing with Arnold's model perturbation only, have obtained, by variational 

methods, very good diffusion time estimates and have introduced new ideas for studying the shadowing 
problem. For isochronous systems new variational results concerning the shadowing and the splitting 
problem have been obtained in Q , [|| and |j| . 

In this paper we provide an alternative mechanism to produce diffusion orbits. This method is not 
based on the existence of a transition chain of tori: we avoid the KAM construction of the perturbed 
hyperbolic tori, proving directly the existence of a drifting orbit as a local minimum of an action functional. 
At the same time our variational approach achieves the optimal diffusion time. We also prove that our 
diffusion time estimate is the optimal one as a consequence of a general stability result, proved via classical 
perturbation theory. As in fl2| we deal with a perturbation which is a trigonometric polynomial in the 
angles and our diffusion orbits will not connect any two arbitrary frequencies of the action space, even if 
we manage to connect more frequencies than in |fl2f , proving the drift also in some regions of the phase 
space where transition chains might not exist. Clearly if the perturbation is chosen as in Arnold's example 
we can drift in all the phase space with no restriction. The results proved here have been announced in 
0- 

In this paper we will assume, as in Arnold's paper, the parameter /i to be small enough in order to 
validate the so called Poincare-Melnikov approximation, when the first order expansion term in fi for the 
splitting, the so called Poincare-Melnikov function, is the dominant one. For this reason, through this 
paper we will fix the "Lyapunov exponent" of the pendulum e := 1, considering the so called "a-priori 
unstable" case. Actually our variational shadowing technique is not restricted to the a-priori unstable 
case, but would allow, in the same spirit of Q and ||, once a "splitting condition" is someway proved, 
to get diffusion orbits with the best diffusion time (in terms of some measure of the splitting). 

We will consider nearly integrable non- isochronous Hamiltonian systems defined by 

I 2 v 2 

= — + !j + (cosq-l)+iif(I,(p,p,q,t), (1.1) 

where (ip,q,t) € T d x T 1 x T 1 are the angle variables, (I,p) £ R d x R 1 are the action variables and 
fx > is a small real parameter. The Hamiltonian system associated with TL^ writes 

cp = I + fj,d I f ) i = -[id v f, q = p + iid p f, p = sing - fid q f. (<S M ) 
The perturbation / is assumed to be a real trigonometric polynomial of order TV in ip and t, namely^ 

f(I,<p,p,q,t)= Yl fnA^P^y (n - lp+lt) - (1-2) 

\(n,l)\<N 

The unperturbed Hamiltonian system (So) is completely integrable and in particular the energy I 2 / 2 of 
each rotator is a constant of the motion. The problem of Arnold diffusion in this context is whether, for 
/i 7^ 0, there exist motions whose net effect is to transfer 0(l)-energy among the rotators. A natural 

2 f n,l(I i Pi 9) = /— n,— i(^>P> ?) f° r a ^ ( ra )0 E Z d X Z with |(n,Z)| < N where z denotes the complex conjugate of 2 £ C. 



complementary question regards the time of stability (or instability) for the perturbed system: what is 
the minimal time to produce an 0(l)-exchange of energy, if any takes place, among the rotators? 

For simplicity, even if it is not really necessary, we assume / to be a purely spatial perturbation, 
namely f(cp, g, t) — X)o<|(n i)\<n fn,l(o) exp(i(n • ip + It)). The functions f n> i are assumed to be smooth. 

Let us define the "resonant web" T> jy, formed by the frequencies u> "resonant with the perturbation" 
V N := G R d 3(n, I) G Z d+1 s.t. < \(n, l)\ < N and u ■ n + I = j = Uo<\(n,l)\<NE n ,l (1-3) 
where := {u> G R d | u> ■ n + I — 0}. Let us also consider the Poincare-Melnikov primitive 
r(w,0 o ,¥>o):=- / f/(w* + ^, «)(*), t + 5o)-/(wt + ^,0,t + 5 )l dt, 



JR J 

where go (£) = 4 arctan(exp i) is the separatrix of the unperturbed pendulum equation q = sin q satisfying 

<ZO(0) = 7T. 

The next Theorem states that, for any connected component C C 2?jy, u>i,u>p G C, there exists a 
solution of (5^) connecting a 0(/z)-neighborhood of in the action space to a 0(/x)-neighborhood of 
top, in the time- interval Td = 0((l/fx) \ log/i|). 

Theorem 1.1 Let C be a connected component ofT> c N , u)j,ujp G C and let 7 : [0, L] — > C 6e a smooth 
embedding such that 7(0) = Wj and 7(£) = wj?. Assume i/ia< ; /or a// £J := 7(s) (s G [0, L\), T(lo,-,-) 
possesses a non- degenerate local minimum (^o>Vo)- Vn > there exists fj^ = ^0(7,77) > and 

C = C(j) > such that V0 < /i < i/iere exists a solution (/^(t), ip ^(t) , p ^(t) , q^i)) 0/ (S^) and iwo 
instants t% < t 2 such that I^irx) — u>i + O(p), i^(T2) = + 0(/i) and 

|r 2 -Ti|<-|lo gAt |. (1.4) 
A* 

Moreover dist(I M (t), 7([0, L])) < n /or a/Z Ti < i < r 2 . 

In addition, the above result still holds for any perturbation fi(f + /i/) wi£/i any smooth f(ip, q, t). 

We can also build diffusion orbits approaching the boundaries oiT>N at distances as small as a certain 
power of [i: see for a precise statement Theorem |6.1[ 



Theorem 1 . 1 improves the corresponding result in Jl 2| | which enables to connect two frequencies u>j and 
ujf belonging to the same connected component C C T> c Ni for N\ — 14dN and with dist{{a;/, uip},!)^} = 
0(1). Such restrictions of |12] in connecting the action space through diffusion orbits arise because 



transition chains could not exist in all C C T> C N (see remark 2.2). Unlikely our method enables to show 
up Arnold diffusion between any two frequencies uii, ujp G C C T> C N and along any path, since it does not 
require the e xist ence of chains of true hyperbolic tori of (S^). 



Theorem IT also improves the known estimates on the diffusion time. The first estimate obtained 
by geometrical method in 12 , is Td = 0(exp (1//J, 2 )). In jij|-[[l3)-(l4[], still by geometrical methods, and 



in H, by means of Mather's theory, the diffusion time has been proved to be just polynomially long in 
the splitting /z (the splitting angles between the perturbed stable and unstable manifolds W^iXIf) at a 
homoclinic point are, by classical Poincare-Melnikov theory, 0(/i)). We note that the variational method 
proposed by Bessi in Jjj had already given, in the case of perturbations preserving all the unperturbed 
tori, the diffusion time estimate Td — 0(1/ /j, 2 ). For isochronous systems the estimate on the diffusion time 
Td = 0((l/fi)\ ln/z|) has already been obtained in Very recently, in 14 , the diffusion time (in the 



non isochronous case) has been estimated as Td = 0((l//i) \ log yu.|) by a method which uses "hyperbolic 
periodic orbits" ; however the result of jl4| is of local nature: the previous estimate holds only for diffusion 
orbits shadowing a transition chain close to some torus run with diophantine flow. 

Our next statement (a stability result) concludes this quest for the minimal diffusion time T^: it shows 
the optimality of our estimate Td = 0((1///)| log/x|). 



Theorem 1.2 Let f(I, ip,p, q, t) be as in (l.i), where the f n> i (\(n,l)\ < N) are analytic functions. Then 
Vac, r,r > there exist [1i,kq > such that VO < /i < for any solution (I(t), tp(t),p(t), q(t)) of (S^) 
with \I(0)\ < r and \p(0)\ < r, there results 

\I(t) - J(0)| < k Vf such that \t\<— In-. (1.5) 



Actually the proof of Theorem 1.2 contains much more information: in particular the stability time 
dl.5| ) is sharp only for orbits lying close to the separatrices. On the other hand the orbits lying far away 
from the separatrices are much more stable, namely exponentially stable in time according to Nekhoroshev 



type time estimates, see (7.4) and (7.11). Indeed the diffusion orbit of Theorem 1.1 is found close to some 
pseudo-diffusion orbit whose (q,p) variables move along the separatrices of the pendulum. 

As a byproduct of the techniques developed in this paper we have the following result (proved in 
section 6) concerning "Arnold's example" Q where 7^ := {7 = ui, tp € T d ,p = q = 0} are, for all u> € R d , 
even for /j ^ 0, invariant tori of (Sn). 

Theorem 1.3 Let f((p,q,t) := (1 — cosq)f(np,t). Assume that for some smooth embedding 7 : [0, L] — > 
R d , with 7(0) = u>i and j(L) — ujp, := 7(5) (s G [0,£]), T(ui,-,-) possesses a non- degenerate local 
minimum (9q , ifg). ThenWr] > there exists fio — ^o(7) v) > 0, and C = Ci^f) > such thatVO < /i < /i 
there exists a heteroclinic orbit (rj-close to connecting the invariant tori T UI and 7^, F . Moreover the 
diffusion time needed to go from a ^.-neighbourhood of r T UJl to a ^-neighbourhood ofT^ F is bounded by 
(C//x)| log/x| for some constant C . 

The method of proof of Theorem |l.l| (and Theorem 1.2) relies on a finite dimensional reduction of 
Lyapunov-Schmidt type, variational in nature, introduced in jl] and later extended in and [pi to 



the problem of Arnold diffusion. The diffusion orbit of Theorem 1.1 is found as a local minimum of the 
action functional close to some pseudo-diffusion orbit whose (p, q) variables move along the separatrices 
of the pendulum. The pseudo-diffusion orbits, constructed by the Implicit Function Theorem, are true 
solutions of (<S„) except possibly at some instants 9 il for i — 1, . . . , k, when they are glued continuously 
at the section {q — n, mod 2nZ} but the speeds (0 jLl (^), q^{0i)) = (1^(0^, p^Oi)) may have a jump. The 
time interval T s — 9i + \ — 6i is heuristically the time required to perform a single transition during which 
the rotators can exchange 0(/i)-energy, i.e. the action variables vary of O(fi). During each transition we 
can exchange only (3(/i)-energy because the Melnikov contribution in the perturbed functional is 0(/i). 
Hence in order to exchange O(l) energy the number of transitions required will be k = 0(1/ (i). 

We underline that the question of finding the optimal time and the mechanism for which we can avoid 
the construction of transition chains of tori are deeply connected. Indeed the main reason for which 
our drifting technique avoids the construction of KAM tori is the following one: if the time to perform 
a simple transition T s is, say, just T s = 0(|ln/z|) then, on such "short" time intervals, it is valid to 
approximate the pseudo diffusion orbits with unperturbed solutions living on the stable and unstable 
manifolds of the unperturbed tori W S (T U ) = W U (T U ) = {I = u,cp £ T d ,p 2 /2 + (cosq- 1) = 0}, when 
computing the value of the action functional. In this way we do not need to construct the true hyperbolic 
tori Tff (actually for our approximation we only need the time for a single transition to be T s « l//i). 

The fact that it is possible to perform a single transition in a very short time interval like T s = 0(\ ln/i|) 
is not obvius at all. In the time to perform a single transition, in the example of Arnold, is 0(1/ /i). This 
transition time arises in order to ensure that the variations of the kinetic part of the action functional 
associated with the rotators are small compared with the (positive definite) second derivative of the 
Poincare-Melnikov primitive at its minimum point. Unfortunately this time is too long to use a simple 
approximation of the functional. The key observation that enables us to perform a single transition 
in a very short time interval concerns the behaviour of the "gradient flow" of the unperturbed action 
functional of the rotators. This implies a sort of a-priori estimate satisfied by the minimal diffusion orbits, 



see remark 6.1. We think that estimate ( 3. IS ) is interesting in itself. In this way we can show that the 
variations of the action of the rotators are small enough, even on time intervals T s << l//i, and do not 
"destroy" the minimum of the Poincare-Melnikov primitive. 



When trying to build a pseudo-diffusion orbit which performs single transitions in very short time 
intervals we encounter another difficulty linked with the ergodization time. The time to perform a single 
transition T s must be long enough to settle, at each instant 9i, the projection (6i,cpi) of the pseudo- 
orbit on the torus T d+1 sufficiently close to the minimum of the Poincare-Melnikov function, i.e. the 
homoclinic point (in our method it is sufficient to arrive just 0(l)-close, independently of /i, to the 
homoclinic point). This necessary request creates some difficulty since our pseudo-diffusion orbit may 
arrive 0(/x)-close in the action space to resonant hyperplanes of frequencies whose linear flow does not 
provide a dense enough net of the torus. The way in which this problem is overcome is discussed in 
section |^: we observe a phenomenon of "stabilization close to resonances" which forces the time for some 
single transitions to increase. Anywa y the total time required to cross these (finite number of) resonances 
is still Td = 0((l//x) log(l//z)), see ( [5.13 ) and the proof of Theorem [0]. This discussion enables us to 



prove optimal fast- Arnold diffusion in large regions of the phase space and allows to improve the local 
diffusion results of Q. 

We need therefore some results on the ergodization time of the torus for linear flows possibly resonant 
but only at a "sufficiently high order" . We present these results in section 0. We point out that the main 



result of this section, Theorem |4.2| , implies as corollaries Theorems B and D of see remark 4.1. It is 
of independent interest and could possibly improve the other results of 

This work is a further step of a reaserch line, started in and J6|, for finding new mechanisms to 

prove Arnold diffusion. We expect that the variational method developed in this paper could be suitably 
refined in order to prove the existence of drifting orbits in the whole action space and then to prove such 
results for generic analytic perturbations too. Another possible application of these methods could regard 
infinite dimensional Hamiltonian systems where the existence of "transition chains of infinite dimensional 
hyperbolic tori" is quite far for being proved. 

The paper is organized as follows: in section 2 we perform the finite dimensional reduction and we 
define the variational setting. In section 3 we provide a suitable development of the reduced action 
functional. In section 4 we prove the new results on the ergodization time. In section 5 wc define the 
unperturbed pseudo-orbit. In section 6 we prove the existence of the diffusion orbit. In section 7 we 
prove the stability result, that is to say the optimality of our diffusion time. 

Notations: Through this paper the notation a(zi, . . . , Zk) — 0(6(/i)) will mean that, for a suitable 
positive constant C(-f, f) > 0, |o(zi, . . . , z p )\ < C(j, /)|6(/x)|. 



2 The variational setting and the finite dimensional reduction 

When the perturbation f(ip, q, t) — Yl\(n i)\<N fn,i{q) ex p(K n ' ¥ + ^)) i s purely spatial, f\ system (S^) 
reduces to the second order system 

<p = -(j,d v f(<p,q,t), -q + sinq = \i d q f{ip,q,t) (2.1) 

with associated Lagrangian 

Cp{<p,frq,q,t) = — + — + {1 - cosq) - fj,f((p,q,t). (2.2) 



Using the Contraction Mapping Theorem we will prove in lemma 2.1 that, near the unperturbed solutions 
(u>(t — 9) + ipo, <7o(t — 9)) living on the stable and unstable manifolds of the unperturbed tori 7L, there 
exist, for /x small enough, solutions of the perturbed system ( |2.1[ ) which connect the sections {<p = ip + , q = 
— 7r, t = 6 + } and {(p — Lp~ , q = n, t = 8~} (under some assumptions). The diffusion orbit will be a chain 
of such connecting orbits. 

We first introduce a few definitions and notations. For A := (6 + , 9~ , tp + , (p~ ) 6 R 2 x R 2d with 



3 We will develop all tl 
see the proof of Theorem ! 



^computations for /. All the next arguments remain unchanged if the perturbation is / + fif, 
1.1. 



9 + < 9 we define T\ := 9 -9+ and the "mean frequency" lo\ £ R d as lo\ := ^— The "small 

9 — 9 + 

denominator" of a frequency lo £ R d is defined by 

f3(co) := /3 n (lo) := min |n-« + i|. (2.3) 

0<|(n,i)|<AT 



/?((*;) measures how close the frequency lo lies to the resonant web V m defined in ( |l.3|) . We use the 
abbreviation [3\ for (3{lo\). We shall always assume through this paper that lo stays in a fixed bounded 
set containing the curve 7. 

For T large enough, there exists a unique T-periodic solution Qt of the pendulum equation, of small 
positive energy with Q T (0) = -tt, Qt{T) = tt. Moreover Q T satisfies V< G [0,T/2) U (T/2,T], 

|drQr(*)| < K ie ~ K ^ T ^ , |9 T (Q T (T - -))(t)\ < 

and 

|Qr(*)-9oo(t)| + |QT(t)-goo(t)| <K ie - K - T , \Q T (t)\ a^axfe-^.e-^f 7 -"}, (2.4) 
for some positive constants _ftTi and K2, where goo is defined by 

qoo(t) = gb(i) - 2tt if t G [0, T/2), Qoo {t) = q (t - T) if t G (T/2,T]. 

Lemma 2.1 There exists (12 > and constants Cq,C\,c, c\ > swc/i i/iai V0 < /i < /12, VA = 
(# + , (/? + , 9?~) suc/i £/ia£ Co/3 2 > /U and Ci|ln/i| < T\ < Cof3\/fj, there exists a unique solution 
(<£>/*(*)> := (w.a(*)?9/*,a(*)) °f (PI )> defined for t £ (9+ - 1,6~ + 1), satisfying (p^{9 ± ) = ip ± , 



q f2 (9 ± ) = =p7r and 

(i) |^(t) - Tp(t)\ < c/x(l + ciM^ 2 )//? 2 ., \4>„{t) -u\< cn/P x , 
(m) !?„(*) -Q^(f-0+)|<<^, |^(<)-Q Ta (<-0+)| <c M , 

where 7p(t) := cja(^ — # + ) + </? + - Moreover (Pn t \{t), Lp^\(t), q^,\{t) and q^,\{t) are C 1 functions of (t, A). 



The proof of lemma 2.1 is given in the Appendix. 



Remark 2.1 Roughly, the meaning of the above estimates is the following. 



1) We have imposed C\\ ln/i| < T\ := 9~ — 9 + so that by (2.4), on such intervals of time, the periodic 
solution Qt x is O(fi) close to "separatrices" q^ of the unperturbed pendulum. 

2) Estimate (ii) implies that for t w (9 + +9~)/2 the perturbed solution g M may have 0([i) oscillations 
around the unstable equilibrium of the pendulum q = 0, mod 2ir, which is exactly what one expects 
perturbing with a general f . On the contrary for the class of perturbations considered in J|/ as /(<£>, g, t) = 
(1 — cos q)f(ip,t) preserving all the invariant tori, estimate (ii) can be improved, getting max{|g At (t)— 
Q Tx (t - 9+)\,\q^(t) - Q Tx {t - 9+)\} = 0(/zmax{exp(-C|t - 9+\), exp(-Cjt - 9~\)}). 

3) For (3\ rs yjji estimate (i) becomes meaningless: for a mean frequency lo\ such that n ■ lo\ + 1 s» JJi 
for some < |(n, l)\ < N the perturbed transition orbits tp^ are no more well-approximated by the straight 
lines Tp(t) := ip + + u)\(t — 9 + ). 

Remark 2.2 Let us define V% := {lo £ R d \ \lo ■ n + l\ > [3, V < \(n,l)\ < N}. In Efti it is 



N 

proved that hyperbolic invariant tori TJf of system (S^) exist for Diophantine frequencies lo £ T>^f , for 
some Pi = 0(1) and some N\ — O(dN) > N , namely avoiding more "resonances with the trigonometric 
polynomial f" than just N. The presence of such "resonant hyperplanes E n> i " for N < \(n, l)\ < N± may 
be reflected in estimate ( i) b y the term fiT% . However such term, for our purposes, can be ignored. From 



this point of view lemma 2. 1 could perhaps be interpreted as the first iterative step for looking at invariant 



hyperbolic tori in the perturbed system bifurcating from the unperturbed one 's ■ 



By lemma 2.1, for < /i < /12, we can define on the set 

A M :={A=(0+,0-,^-) J Go/? 2 >n, Ci I In /i| <T\ < ~~~ } > 
the Lagrangian action functional G„ : A p — > R as 



G M (A)=G,,(0+0-, / C^{t),ip M (t), qil (t),q^t),t) dt. 



(2.6) 



We have 

Lemma 2.2 G M is differ entiable and (with the abbreviations tp,q for <Py.,qn) 

V^+G^A) = -0(0+), a e+ G M (A) - \w(9+)\ 2 + ig 2 (0+) + cosg(0+) - 1 + tt, 0+) 

V^-G M (A) = 0(0"), a e -G M (A) = -(i|^-)| 2 + iq 2 (^) +cosg(0-) - 1 + p/frr, tt, 0")). 



Proof. By lemma 2.1 the map (A, t) 1— * (<Pfj,,x(t),(p fl! x(t),q IMt x(t),q f j lj x(t}) is G 1 on the set {(A,t) € 
A,, x R I 0+ < i < 9~}. Hence G p is differentiable and 

cVG M (A) = -£^+^(e+),-TT, q (e+),e+)+ [ <p( 8 ) -d e+ <p{s) + q{s)d e+q {s) d s 

smq(s)d g +q(s) - fid lp f(<p{s),q{s),s) ■ d g +ip(s) - fid q f(tp(s), q(s), s)d g +q(s) ds. 
Integrating by parts and using that (q^.A, ¥V,a) satisfies (2.1) in (9 + ,6~), we obtain 



dg+G^X) = <p(6 + ), -tt, q(9+),9+) + [q(s)d e +q(s) + cp(s) ■ d e+ tp{s) 

Now q^\{6+) = -tt for all A hence q{9+) + d e+ q(6+) = 0. Similarly we get tp(6+) + d g+ ip{9+) = 0, 
dg+q(9~) = 0, d e +<p(9~) — 0. As a consequence 

de + G,(X) = \\v\ 2 {9+) + l -e{9+) + (cosq(9+) 1) + tt, 9+). 

The other partial derivatives are computed in the same way. ■ 

For P > fixed, denoting Ai = (0,, 0j+i, 92,, Vi+i), we define on the set 



A fi t) ft W )6R*xR M I Vl<t<fc-1, A, G A M , /3 Xt >/?}, 

the reduced action functional : A Mi fc — > R as 

I-/I 2 



i=l 



WFVfc H — fc + fiT s (uj F ,9k,^k) - fiF(u F , Ok, <Pk) 



where 



r>,0 o ,¥>o) : 
r>,0 o ,¥>o) ■-- 



oo 

-00 r 



/(wf + po, 9o W, t + O ) - f(ut + (f , 0, t + 6»o)) 
f(ut + <po, Qo(t),t + 9 Q ) - f(wt + <p , 0, t + do)) 



dt, 



dt, 



(2.7) 
(2.8) 



are called resp. the unstable and the stable Poincare-Melnikov primitive, and 

e i(n-ip +l6 ) 

fn,t 

0<\{n,l)\<N 



F(u,9 ,<po):=-fo,a9 - V f n ,i^ -77, (2.9) 

z — ' i(n ■ d + I) 



fn,l '■— fn,i(0) being the Fourier coefhcients of f((p,0,t). 

Critical points of the "reduced action functional" J- "„ give rise to diffusion orbits whose action vari- 
ables I go from a small neighbourhood of ojj to a small neighbourhood of ujp, as stated in lemma 2.3 

I |2 I |2 

below. The "boundary terms" wxfi — 2~9i + (jF u {lji, Oi,ipi) + (j,F(lji, 9\, ipi) and —ujpipk + 2 ® k + 
)j,T s (ujf, 9k, (pk) ~ (J-F(uip, 9k, fk) have been added also to enable us to find critical points of w.r.t. all 
the variables (including 9\,p\,9k,^Ph)- 

More precisely, for A = (9,p) G ^n,k we define the pseudo diffusion solutions (w,a, Q(j.,\) on the 
interval [9\, 9k] by 



(^,A(*),<k,A(t)) :=(VM,A*(*),«M,Ai(*)+27r(*-l)) for *G[« 



where (^,Ai (i)> ?ju,A» (*)) are given by lemma |2.l| . The pseudo diffusion solutions ((/^A) <7^,a) are then 
continuous functions which are true solutions of the equations of motion (2.1) on each interval (#j,#j_|_i), 
but the time derivatives (<Pn,\, ^a) may undergo a jump at time 0j. We have 



Lemma 2.3 If X — (9,tp) G A^.fc is a critical point of T^, then (ip ~(t),q ~(t)) is a solution of 
in the time interval (6*i, 9k)- Moreover ^(#1) = Wj + 0(/i), ip^k) = ujp + 0(/i), i.e. (cp ~,q ~) is a 
fusion orbit between luj and top with diffusion time Td — \9k — 9\\. 



PROOF. By lemma |2_2| if V^JF^A) = 0, then for 2 < i < k - 1, (p ^(0r) = tp ^(9+) and p ~(0i) = 
+ 00), ip j(9 k ) =oj f + 00). Moreover, if V^-F^A) = and 9^^>(A) = then (for 2 < i < fc - 2), 
<f~(0+) = g 2 ~(0r). Now, by lemma [0| and Q, 9 = <fo(0) + O(/i). Hence q ~{9+) = q ~(0r) 

and the proof is complete. ■ 



3 The approximation of the reduced functional 



In order to prove the existence of critical points of the reduced action functional T ^ thanks to the 
propert ies of the Poincare-Melnikov primitives T(lu,-,-) we need an appropriate expression of J 7 ^, see 



lemma 3.5. We shall express as the sum of a function whose definition contains the T(u>, ■, •) (for 
which we can prove the existence of critical points) and of a remainder whose derivatives are so small 
that it cannot destroy the critical points of the first function. 

The first lemma gives an approximation of G M (defined in fl2.6|) ) . 

Lemma 3.1 For < \l < fx^, for A G A M we have 

G M (A) = \ ^ e S~X+) + Mr>A ' 9+ > ^ +) + /^"("A, V") - f* J 0, t) dt + i?o0, A) (3.1) 

where 



Va-Ro 



PROOF. By lemma we can write W,a(*) = + v ^( t )> 9^a(*) = Qr x {t - 9+) + „,,.„ 

^a(#+) = ^,a(^) = 0, IKa|U»(0+,0-) = 00/^a), ||« M ,xlU-. (fl +, fl - ) - O(0//35)(l + M T A 2 )) 
wV,a(# + ) = w^\{9 ) = 0, \\w^x\\L°°(9+,e-) + lk/i,Alk~(e+,e-) = OO) 



(3.2) 

Wn,\(t), where 
and 



In the following, in order to avoid cumbersome notation, we shall use the abbreviations v,w,Q for 
v fj.,\j w fj.,\i Qt x {' ~~ the dependency w.r.t. A and \i being implicit. We have 



G M (A) 



+ \\W)\ 2 +W) ■ m + \W)\ 2 + \Q 2 {t) + Q(t)w(t) + \w\t) 

0- 

[1 - oos(Q(t) + w{t))] - fJif{Tp{t) + v(t), Q(t) + w(t), t) dt. 



Now since v( 



9+ 

where 



Q(t)w{t) dt = 



") = and w(6+) = w(0-) = 0, / lp(t) ■ v(t) dt = w A • v(t) dt = and 

Je+ J0+ 

- .. r°- 
-Q{t)w{t) dt = / -(smQ(t))w(t) dt. As a result, G M (A) = G°(X) + i?i(A), 
J0+ 

G°(A) = 



\\M 2 + \Q 2 + (1 " cos Q) - tf(lp, Q, t), 
+ z z 



f 1 1 

RiW = / t:\v\ 2 + o^ 2 + (cosQ - cos(Q +w) - wsinQ) - [if(Tp + v,Q + w,t) + fif(ip,Q,t). 
J0+ z Z 

We shall first prove that | Vi?i | = o( - ^^T^ Ta) . We have d e +Ri = r s l + r 2 + r 3 + r 4 + r 5 + r 6 where 



r-2 



+ etc 

w — (dg+w) + sin(Q + u>) — sinQ — ^d q f{Tp + v, Q + w. t) (dg+w), 
+ dt L J 



r3 

r 4 := /i 



r 

/ (— sin Q + sm(Q + w) — w cos Q)dg+ Q, 
J0+ 

d ip f(p,Q,t)-d v f(Tp + v,Q + w,t) -dg+Tp, 



r 5 := fi 



re 



9gf(<P,Q,t) - d q f(ip + V,Q + W,t) dg+Q 



-w( 



q+\2 



Now v and w satisfy 

-v(t) = fid v f(^(t)+v(t),Q(t)+w(t),t) 
-w(t)+sin(Q(t)+w(t)) = fidJ(<p(t)+v(t),Q(t)+w(t),t)+sinQ(t). 

Moreover, deriving w.r.t. 9 + the equality v(9 + ) = we obtain that (dg+v)(9 + ) = —v(8 + ). Similarly 
(dg+w)(9 + ) — —w(8 + ), (dg+v)(9~) = and (dg+w)(9~) = 0. Therefore an integration by parts gives 
n = h>(6>+)| 2 , r 2 = w(8+) 2 hence |n| + \r 2 \ = 0(ti 2 //3 2 ). 

By the properties of Qt, dg+Q is bounded in the interval [# + ,# - ] by a constant independent of A. 
Moreover - sin Q(t) +sin(Q(t) + w(t)) - w(t) cos Q(t) = 0(w(t) 2 ). Therefore r 3 = 0(^ 2 T). 

We have also, for some positive constant c, 



K I + |r 5 | < C/J.T 



sup |a fl+ Q(*)| + |a e+ ^)| sup (|«(*)| + K*)|) 

te[6+,e-] J L te[e+,6»-] 



Since dg+ip is bounded independently of A, we have by lemma 2.1 |r 4 | 
lemma 2.1, ?*6 = 0(/i 2 //9 2 ). The estimate of the other derivatives of i?i is obtained in the same way. 



_n/ ^ (kHfZkl r A ). Still by 



indent We now develop G°(A) as 



G° (A) = \ ^ _^' } 2 + fiT s {u>x ,e + , <p + ) + hT u (cj x ,6- , <p~ ) - /x J* ffflt) , 0, t) dt + R 2 (A) + R 3 (A) , 



where 



i? 2 (A) 



ig 2 (t) + (i-c 0S g(i)) dt 
+ ^ 



1 . 



2 T (t) + {1- cos Q Tx {t)) dt, 



(3.3) 



fls(A) 



(/(?(*), Q(t),t) - f(Tp(t), 0, t) dt - [iT s (cj x ,e+, <p+) - ^T u (io x , 8-,<p-). 



There remains to prove estimate ( |3 . 2[ ) for Vi? 2 and Vi? 3 . By (3.3) <9 v ±i? 2 = and 9 6) +i? 2 (A) = — dg-R^X) 
is the energy of the TVperiodic solution Qt x of the pendulum equation. Now this energy is 0(e~ C2Tx ). 
Hence (provided C\ is large enough) |Vi^(A)| = 0(fi 2 ). 

In order to estimate the derivatives of R3, let us define g(<p, q, t) := f(<p, q, t) — f(cp, 0, t). We have 



where 



03(A) 



-t*g(Tp(t),Q{t), t) dt - fiT s (uj x , d + ,<p+) - nT u (io x , 0-,<p-) = mMA) + 63(A)) 



T A /2 roc 

g(LU X t + ^ + ,Q Tx {t),t + + ) dt+ / g(u) X t + (p + ,q {t),t + 9 + ) dt, 
Jo 



r pO 

63(A) :=-/ g{ui x t + <p-,Q Tx (t + T x ),t + 6-) dt+ g{u x t + <f ,q {t),t + 9~) dt. 

-T x /2 J -00 



We have 



03(A) = - 



Ta/2 r 



o(wAt+y + ,QT,(t),t+e + )-.g(^Ai+^ + ,goW,t+e + ) + / g{oj x t+ v > + ,q (t),t+8+). 

T x /2 



Recalling that sup t6(0 T/2 -, \d T QT{t)\ = 0(e~ C2T ), sup te(0 T/2) |Qr(t) - ?o(t)| = 0(e~ C2T ), it is easy to 
see that the derivatives of the first integral are 0(T x e~ C2Tx ) — O(p) (still provided C\ is large enough). 
Moreover, using that (\g(u} X t+tp+ , q (t), t)\ + \d ip g{uj x t+ip + , q (t),t)\ + \d t g(uj x t+ip + , q (t),t)\) = O(q {t)- 
2n) = 0(e~ C2t ) for t £ (Tx/2, +00), we find that the derivatives of the second integral are O(n) as well. 
Hence |Vd3(A)| = 0(/i). The same estimate holds for 63. We then conclude that Vi?3(A) = 0(/i 2 ), which 



completes the proof of lemma 3.1 



In section || we will look for a critical point of in the set 

E := {X=(0i,...,e k ,<p 1 ,...,<p k ) £ R fc xR fcd J 9 i = 9 i + b i , + \k\ < 2ir, \a t \ < 2vr}, (3.4) 

where k,Tp i , 9i will be defined in section H It will result that E C A M) & (for some (3 > depending on the 
curve 7). In particular, for all A £ E 



Ci|ln/i| < 9 i+1 -9 t < —, V* = 1, - 1, 



(3.5) 



where $ := (3 Xi := /?(u>j) and u>i := cl% := {fi+i — <Pi)/{9i+i ~~ Moreover we will assume (see (|5.8|)) 



— Wj| < p/i where 



(1 < i < k — 1), ujq :— lji, LUk := (3-6) 



and p > is a small constant to be chosen later (see (5.3)). For the time being, assuming (3.5) and p.q ), 
we want to give a suitable expression of in E. By lemma 3.1, for A G E, we have 



t m ^ 1 - jgj | 2 , M 2 fl , M 2 , 



+ ^(F^-i, fli, ^) + r-fwi, Oi, w)) + pF(u)i, U <pi) (3.7) 

i=l 

- /M*-0;) + ^,O,*) dt-/iF(wF,0fe,^fc)+^i?o(At,Ai), 

i=l i=l 



where | V,\-Ro(/i, A)| satisfies (3.2). We shall write ^ in an appropriate form thanks to the following 
lemmas. The first one says how close the "mean frequencies" w,; are to the unperturbed 57j. 

Lemma 3.2 Let A = (#i, . . . , 6k, <pi, ■ ■ ■ , fk) belong to E. Then 
Moreover 

r u (u i - 1 ,e i ,<p i )+r a (u i ,6 i ,<p i ) = r(LJ i ,6 i ,<p i )+R A (\ i ), where V# 4 = 0(1/| ln/i|). (3.9) 

Proof. Set A#i := 9i + \ — 9i, Acii :— a.; + i — a, and Ao^ := — 6.;. By an elementary computation we 
get uii — Ui — —ZJiAbi/ A9i + Aa^/A^. By the definition of E and (3.5), estimate ( |3.8| ) follows. 

From the definition of T u , V s and the exponential decay of go it results that d u) T u ' a is bounded by a 



uniform constant, as well as its partial derivatives. Hence ( p.9[ ) is a straightforward consequence of (3. 
and of (Eel). ■ 



Lemma 3.3 For < /i < /14 



/ + /(wi(t-e0 + w.0,i) ^-M-P(^^fe^fe) = X]^5(M,Ai-i,A i ), (3.10) 
i=i i=i 

where, for all i 
Proof. We have 

•Ot+i 

f(tpi +uji(t- 6i),0,t) dt = F(wi,8 i+ i,ipi +1 ) - F(ui,6i,ipi) 

= (^F(uji, 6 i+ i,ip i+1 ) - F(u>i-i,9i,(pi) \ + [F(uJi-i,6i, tpi) - F{ui,Q h <^)) , 
where F(cu, •, •) is defined in ( |2.9| ). We obtain 

^(wj.flx,^)-^/! / f{<pi + Wi{t-6i),Q,t)dt-vF{w F ,6 k ,<p k )=^Ri 

1=1 i=l 

where 

i?5 := i?5(^, 0i-i 5 Vi-i, 6i, tpi, 9 l+ i,ip l+1 ) := fi(F(uJi-x,6i, tpi) - F(ui,8i, ipM 

e i(n-<pi+Wi) . y 1 



^ ^ n ' 1 i V ( // • jj,. 1 + / ) ( ;/ • - / i 

0<|(n,0|<iV V V ; 



*In the cases i = 1, i = k we only have i?g = iijt ([i, 6i,<pi,02,<P2) and _Rg = (/i, 9k—i,>fik—i,9k, l Pk)- 



Now we prove (3.11). Let us consider for example dg^l- We have 



-Wi-l 



d u F((A}i,6i,tpi).- 



UJ, 



0<\(n,l)\<N 



(n ■ u>i-i + I) (n ■ u>i + I) 



where 



E ne i(n-p +W a ) 
fn,l~, 
i(n • ( 

0<\(n,l)\<N 



i(n • lu + I) 2 ' 



(3.12) 



(3.13) 



Estimate ( |3.11 ) follows immediately from ( 3.12 ) and ( |3.13j ). The other partial derivatives of R\ can be 
estimated similarly. ■ 

Finally, to get a suitable expression of T^, we find convenient to introduce coordinates (6, c) S R( 1 + d )' £ 
defined by (3.4) and 



Ci = a, - uj t bi 



Vi = 1, . . . , k, 



(3.14) 



(we are just performing a linear change of coordinates adapted to the direction of the unperturbed flow 
at each i-transition (hi, a^) = 6j(l,5Jj) + (0, Ci)). 



Lemma 3.4 We have 



k-l 

i=l 



1 - <fif 



i+1 



fe-1 



\Ci+l - c,| 5 



(3.15) 



2^A8 l + (b i+1 - h) 
fc 

i=l 

where A6i := — 0j ancQ 

V^(/i,^_i,^_i,^,^,^ + i,^ +1 ) = 0(As?i) = 0(p/x). (3.16) 

Proof. Let {7i}i=i,...,fc-i be defined by ifi+i — <fi = Ui(9i+i — 9i) + ji. We can write u>i(pi — LUF^fik as 

fc-i 



^ {{pi-i - Ui)ipi - uJi((fi+i - <fii)j + ip k (uJk-i - Wf) 
i=i 
fe-i 

^ ((pi-i - Ui)ipi - \uji\ 2 (9 i+ i - 6i) - u^A + ip k (uj k -i - Wi?). (3.17) 



We can also write 



A— 1 



: «* = E(( 



k-l 
i=l 



o )°* + ^ 



k-l 



'i+1 — "i, 



\lo f \ 2 \u k -l\ 2 



1 \<£j+l - ifj 

2 (8 i+1 -6i) ^2 



i=l 



1 N 2 

2 (fli+i - i 



0k, (3.18) 
(3.19) 



Summing (|3~18|) and |yj|) we get 

^ 1 bi+l ~ ^:| 2 



7t+l 



+ UJlfi - iO F <fk 



2 yfc -Z.2(^ +1 - ( 



5 For i = k we have ijg = ^Jfc). 



— '\uJi\ 2 \LUi-l\ 2 



1=1 



\UJ F \ 2 |Wfe-l| 2 



(3.20) 



Substituting ip i + a% for tp^ and &i + 6; for 0,, we get 7^ = (flj+i — a,) — w;(&i + i — Moreover the non 
constant terms in the right handside of (3.20) (i.e. those depending on dj,&j) are the first one and 

\Uj,\ 2 \uJi-l\ 2 



»=i 



6 t =:^i? i (M,^,^) 



i=i 



with Vi?'(/i, 6*i, = O(AcJj). Finally, expr essin g 7.; in terms of Cj) we get 7, = (oj+ i — dj) — a->i(&,; + i — 
&i) = (cj+i ~ Cj) + ^i+i A(Uj and then from ( 3.20[) , developing the square, we get ( |3.16| ). ■ 

From ( ft.7| ) lemmas 3.2, 3.3 and 3.4 we obtain the expression of in the new coordinates (b, c) 
required to apply the variational argument of section ^|. 

Lemma 3.5 There exists 115, C2 > such that V < fi < fi§, if 

13, > C 2 max { » 1/2 (6 i+1 - 6i) 1/2 , (i(6 i+1 - ^) 3/2 , (9 l+1 - O^ 2 } (3.21) 

then 



1 1 I 12 fe 

9 E at? r ^ + My;^^ + & if ^ + ^ + ^ + j^, C ), (3.22) 



R 7 {b, c) := }^ R\{^, h-i, Cj-i, h, Cj) h+i, Cj+i), 



|Vi^| < c 2W . 



(3.23) 
(3.24) 
(3.25) 



Proof. It is easy to see that (3.6), ( p.q ) and ( 3.21 ) imply (provided is small enough) that 

%i < ft < 2&-i, |A - ft-il = o( 1 + 1 +/A 

Noting that <9 C , = 9 Vj and d 6i = Zj 4 <9 Vi + <9e s , estimate (p~24|) follows from (fj), (|J), (|3~Tl|) , (p~25|) and 
S). ■ 



4 Ergodization times 

In order to define Tp i: 0i (1 < i < k) we need some results, stated in this section, on the ergodization time 
of the torus T l := H L /Z l for linear flows possibly resonant but only at a "sufficiently high level", 

Let fi £ R'; it is well known that, if il ■ p ^ 0, Vp G Z z \ {0}, then the trajectories of the linear flow 
{ilt + A} t £R are dense on T' for any initial point .4 e T ! . It is also intuitively clear that the trajectories 
of the linear flow {ilt + A} t ^R will make an arbitrarly fine 5-net (S > 0) if f2 is resonant only at a 
sufficiently high level, namely if f2 • p ^ 0, Vp £ Z l with < \p\ < M(S) for some large enough M(8). Let 
us make more precise and quantitative these considerations. 

For any 17 g R' define the ergodization time T(f2, S) required to fill within 6 > as 

T(Cl, S) = inf {i £ R+ Vx £ R', d(x, A + [0, t]fi + Z l ) < s\, 

where d is the Euclidean distance and A some point of R z . T(fi, (5) is clearly independent of the choice 
of A. Above and in what follows, inf E is equal to +oo if E is empty. For R > let 



a(Q,R) =inf {|p-Q| peZ 1 , p^0,|p|<i?}. 



3 In the cases i = 1, i = k we have = R^(fJ,, 9i,ifii,02,<P2) and _R* = (/i, 8k-i, tpk—l, 0k, <Pk) 



Theorem 4.1 Vi £ N there exists a positive constant ai such that, VO G R', V<5 > 0, T(fl,5) < 
(ai^ai/S))- 1 . Moreover T(fi, S) > (l/4)a(0, 1/4S)" 1 . 

In the above Theorem a -1 is equal to if a = +oo and to +oo if a = 0. 

Remark 4.1 Assume that 0, is a C-t Diophantine vector, i.e. there exist C > and r > I — 1 swc/i that 
Vfc G Z z |fc • n\ > C/\k\ T . Then a(Q,R) > C/R T and so T(fi,tf) < aJ/CS T . This estimate was proved in 
Theorem D of UW. Also Theorem B of /pi|/ is an easy consequence of Theorem 4.1. 



Theorem 4.1 is a direct consequence of more general statements, see Theorem 4.2 and remark 4.2. 
Let us introduce first some notations. Let A be a lattice of R', i.e. a discrete subgroup of R z such that 
R'/A has finite volume. For all Vl G R ( we define 

T(A, n, 5) = inf ji G R+ Vie R* d(x, [0, t]Q + A) < s\ 

(T(A, f2, 5) is the time required to have a <5-net of the torus R'/A endowed with the metric inherited from 
R'). For R > 0, let 

A* = |pGR' VAgA, p-Agz| and A* R ={peA* < \p\ < R,\ 

(A* is a lattice of R' which is conjugated to A). We define 

a(A,0,i?) =inf ||p-n| pGA^}. 

The following result holds: 

Theorem 4.2 VZ G N there exists a positive constant a/ such that, for all lattice A of R'. Vfi G R ; , 
V<5 > 0, T(A, Q, S) < (a(A, O, aJS))- 1 . 

Remark 4.2 It is fairly obvious that T(A, O, S) > (l/4)a(A, f2, 1/4£) _1 . Indeed, assume that A*, u ^ 
and let p G Ay iS be such that p ■ VI — a :— a{A, il, 1 /4<5) . Let x G R' satisfy p ■ x = 1/2. Then 
Vi G [0,1/40;), VAg A, 

|a:-(tfi + A)| > \P-( x ~ m - X )\ >A5\p-x-tp-n-p-X\, 
\P\ 

and p ■ x - p ■ X £ (1/2) + Z, whereas \tp ■ Q| = ta < 1/4. Hence \x - (t£l + A)| > S. 



In the next section we will apply Theorem 4.1 when £1 = (uj, 1) G R d+1 . The proof of Theorem 4.2 
is given in the Appendix. We could give an explicit expression of a;. However it is not useful for our 
purpose and the constants a\ which can be derived from our proof are certainly far from being optimal. 



5 The unperturbed pseudo-diffusion orbit 

Consider the set Qm of "non-ergodizing frequencies" 

Q M := G R d 3(n, G Z d+1 with < \(n, Z)| < M, and u ■ n + / = oj = |J E h 

hes M 

where S M := {h = (n,l) G (Z d \ {0}) X N | < \h\ < M, h± jh',Wj G Z,h' e(Z d \ {0}) x N} and 
E h = E n> i := {lo G R d | (w, 1) • h = cj ■ n + I = 0}. By Theorem III] (or Theorem |J, with A = 2nZ d+1 ), 
for S > 0, if lo belongs to 



) C M = |w G R d uj-n + l^Q, V0 < |(n,Z)| < mV 



(5.1) 



with M = 8ira,d+i/S, then the flow of (w, 1) provides a 5/4-net of the torus T d+1 . 
Moreover if u Q M then for all (n, I) G Z d \{0} x Z, 

|n • a; + l\ = |n|dist(w, E n ,i) > dist(a>, E n ,i) > dist(w, Q M ) > 0. (5.2) 



By Theorem L_L (or Theorem 4.2), we deduce from (|5.2|) the estimate 



which measures the divergence of the ergodization time T((u>, 1), S) as u> approaches the set Qm- 

Definition 5.1 Given M > 0, a connected component C ofT> c N andu>i,u>p G C, we say that an embedding 
7 € C 2 ([0, L],C) is a QM-admissible connecting curve between ojj and up if the following properties are 
satisfied: 

(a) 7 (0)=w/,7(L)=w Fj | 7 (s)| = 1 VsG (0,£), 

(b) V7i = (n, I) G Vs G [0, L] such that j(s) G E h , n ■ j(s) ^ 0. 

Condition (b) means that for all h G Sm, 7([0, L}) may intersect transversally only. It is easy to 
see that condition (b) implies that = {s £ [0, L] | 7 (s) G Qm} is finite and that there exists v > 
such that for all s G 1(7), for all ft, = (n, i) G Sm such that 7(s) G Eh, \j(s) ■ n\/\n\ > v. 

If a curve a is not admissible we can always find "close to it" an admissible one 7. Indeed the following 
lemma holds. 

Lemma 5.1 Let M > 0, C be a connected component ofT> c N , ujj,u!f G C and let a G C 2 ([0, Lq],C) &e 
an embedding with a(0) — Wj and a(£o) — w f- Then, V 77 > 0, f/iere exists a curve 7, Qm -admissible 
between ujj and top, satisfying dist(7(s), a([0, Lq])) < n, Vs G [0, £]. 

Proof. First it is easy to see that there exists an embedding ct\ : [0,£i] — > C such that a%(0) = 
uJi,cti(Lx) = up, dist(«i(s), a([0, Lo])) < ?7/4 and V h = (n,l) G 5m, ^ (resp. ujp ^ E^) or 
di(0) • n 7^= (resp. di(Li) • n 7^ 0). 

Let r > 0,vi > be such that Vs G [0, r] U [Li - r,L{\, Wi = (n,l) G 5m, dist(ai(s), 2?/,) > ^1 
or |di(s) • n\ > v\. Let (f> : [0, Li] — > [0,1] be a smooth function such that (f)(0) = 4>(L\) — and 
Vs G [r,Li -r] 0(s) = 1. 

We shall prove that for all e > there exists uj e G R d , |w e | < e, such that Vh = (n,l) G 5m, for 
all s G [r, Li — r] such that ai(s) G 25/, + uj e , ai(s) • n 7^ 0. For h = (n,l) G 5a/, let Jh = {s G 
[r, Li - r] I n • di(s) = 0} and 14 = {ai(s) — u | s G Jh, u G 2?/,}. Let ^>/, : [r, Li - r] X 2?/, — > R d be 
defined by iph(s,u) = a±(s) — u. Dtjjh{s,u) is singular iff s G Jh- Therefore Vh is the set of the critical 
values of iph and by Sard's lemma, meas(Vh) = 0. Hence for all e > there exists ui £ G R d such that 
\oj £ \ < e, uo e £ Vh for all h G Sm- Our claim follows. 

Now we can define 012 : [0, L{\ — > C by ci2{s) — Qti(s) — <p(s)uj e . It is easy to check that, provided e is 
small enough, ct2 is an embedding which satisfies condition (6). 7 is obtained from cxi by a simple time 
reparametrization. ■ 

If r(a(s),-,-) possesses, for each s, a non-degenerate local minimum (9^ s \ lpq^), then, by the Im- 
plicit Function Theorem, along any curve 7 sufficiently close to a, T(7(s), •, ■) possesses local minima 

(0Z W ,Vo W ) such that 

D 2 {e ^r( 1 (s),9^ s \rt is) )>Xd, Vsg[0,L], (5.4) 
for some constant A > depending on a. Therefore, by the above lemma, it is enough to prove the 



existence of drifting orbits along admissible curves 7. Property (5.4) will be used in lemma p.l| . 

Given a QM-admissible curve 7, let us call s*,...,s* the elements of T(pf), and uj^ = 7(s*), ■ • • ,u>* = 
7(s*) the corresponding frequencies. Since, Vra = 1 , . . . , r, (0Q m , tp% m ) is a nondegenerate local minimum 
of r(w* n , •, ■), there is a neighborhood W m of u* n such that, Vw G W m , T(uj, ■) admits a nondegenerate 



local minimum (9q,<Pq), the map uj \— > (0qi<Pq) being Lipschitz-continuous on W m . Therefore we shall 
assume without loss of generality that for all m = 1, . . . , r, 

V(cV)e(W m n 7 ([0,L])) 2 \(ft,<f%)-ffi',<frf)\<K\oi>-o>'\- (5-5) 

It is easy to prove that, if 7 is an admissible curve, there exists do > such that 

(*) {s E [0, L] I dist(7(s), Qm) < ^o} is the union of a finite number of disjoint intervals [Si, S[], . . . , 
[S r ,5^.]; for all m — l,...,r each interval [S m ,S' m ] intersects 1(7) at a unique point s*„ and 
j([S m , S' m ]) C W m . Moreover (s 1— > dist( 7 (s), Qm)) is decreasing on [,§ m ,s£j, increasing on 
(s* n , S'J, and dist( 7 (s), Qm) > (i//2)|« - s* m \ for all s e [S m , S'J. 

Now we are able to define the "unperturbed transition chain" : for some small constant p > which will 
be specified later we choose t£N and k + 1 "intermediate frequencies" 

=: 57o, . . . ,SJfc-i,S7ft := 
with tUj := 7 (sj) for certain =: so < si < . . . < Sk-i < Sk '■— L verifying 

^ < Si+i - Si < pp, Vt = 0, . . . , k - 1. (5.6) 



By (5.6) there results that 

— <k<— , (5.7) 
p/i pp 

moreover it follows from (a) that 

|5Jj + i —57,- 1 < pfi, Mi = 0, . . . , k - 1. (5.8) 



This condition has been used before in lemma 3.4, Given k time instants 9\ :— 9 1 < 82 < ■ ■ ■ < Q% < 
. . . < 9k, we define the {^,-}i=i,...,fe by the iteration formula 

Vi= l P% 1 , Vi+i='<Pi + Vi(6i + i-6i). (5.9) 

The choice of the instants {#i}i=i,...,fc is specified in the next lemma: the main request is that 
must arrive (5-cl ose m o d 27r Z d+1 , to t he l ocal minimum point (9q i , ip^ 1 ) of the Poincare-Melnikov primitive 
Tpi, ; •), see From we derive that if Wj is 1/ ln/x| far from the set Qm of "non- 

ergodizing frequencies" we can reach this goal for "short" time intervals 9i+i — 9i « | In /x | . In order to 
cross the set Qm of "non-ergodizing frequencies" we need to use longer time intervals 9i+i — 9i « 1/ 
dist(Q M,^i) if yWI In |t I < dist(Q.w,57i) < 1/| In Ml- When the ZJi are "close" (less than ^p/\ln.p\- 
distant) to the set of non-ergodizing hyperplanes Qju we choose again Oi+i — 6i w |lnp|. We also 
estimate in (5.13) the total time 9k — 8\ = J2i=i — ®i- 

Lemma 5.2 V<5 > there exists pe > smc/i f/iai V0 < p < pe there exist {0i}i=i,...,fc with 9\ = 0Q 1 
satisfying, 

• (i) if dist(UJi,Q M ) > jt^i then 



maxjdllnpl * } < 9 l+1 - 9 t < 2max{c 1 |lnp|, ^ }, (5.10) 

I dist(Wj, Qm) > >■ dist(cJi, Qm) > 

where M = &na,d+i/5; 

• (it) ifdist(TJi,Q M ) < then Ci\\np\ <9 l+1 -6 t < 2Ci\\np\, 



and such that 



dist((^,^) ) (^ ! ^) + 27rZ d+1 ) <S, Vz 



= 1,- 



(5.11) 



where ip 1 , . . . ,<p k are defined by (5.9). Equivalently, Vi = 1, . . . , k, there exist hi £ Z d+1 and Xi G R d+1 
such that 

(e i ,Tp i ,) = (ef,<pf) + 2TTh i + X i with \ Xi \<s. (5.i2) 

Moreover there exists a constant K(j) such that 



9 k -9 x < K(i) 



(5.13) 



Proof. Let ^ > be so small that ^/Jlq/\ hi^l < and a/| In fi e \ > 32 V / Ci/ '(vy/Sp). 

Let us define (9i,'ip 1 ) '■— (Oq 1 ,^ 1 ). Assume that (9i,...,9i) has been defined. If dist(ZUj, Qm) > vW 
|ln/i| then by (5.3) there certainly exists (9i + i,Tp i+1 ) satisfying (5.S),( 5TC| ), such that 



dist(^ +1 ,^ +1 ), (c +1 ,^o 1+1 ) + ^z d+i ) < */4. 



We now consider the case in which ZZJ.j is close to some "non-ergodizing" hyperplanes of Qm- If dist(a7i_i, 
Qm) > V^/U 11 ^! and dist(aJj, Qm) < \fp-/\^' a -P\ we proceed as follows. We have ZJi = 7(sj), with 
Si € [S^S^] for some 1 < g < r. Moreover, by property (*) there exists p* € N such that {j € 
{1, . . . , £;} I s 3 g 5,] and dist(oJj-, Qm) < vWI ln Ml} = {*>•••! « +P* - 1}, and Si < s* < s i+p *-i. We 



shall use the abbreviations s* for s*, and cj* for u* We claim that 



1 < P* < P 



4 A /CiP/u|ln/x| 



(5.14) 



In fact, by fl5.6|) and (*) 



^fp(p* - 1) < 7rK s i+P*-i - s *) + ( s * - s i)] < dist(a; i+p »_i, Q A f) + dist(a/ i; Q M ) < 2 
4 2 I In /ij 

Hence p* < 8(^p v / /I| ln/i|) _1 , which implies . 14[ ) , by the choice of /i6- 

Now we can define the . . . ,9i +p *. The flow of (w*, 1), as any linear flow on a torus, has the 
following property : there exists T*(lu* , 5) > (abbreviated as T*) such that any time interval of length 
T* contains t satisfying dist((iw*, t), 27rZ d+1 ) < 5/4. 

Therefore (provided Ci| ln/xg| > T*) we can define . . . , such that 

Ci|ln M | <9 i+j+1 -9 i+j < 2C 1 |1ha*|, dist((0 i+i , & +i ), (0^) + 27rZ d+1 ) < 5/4, (5.15) 
where £>i+j = ^ + uj*{9i + j — 9i). For 1 < j < p* , let 



q=l 



(5.16) 



We now check that for all j = l,...,p*, {9i+j,ip i+ j), as defined in ( |5.15 ) and (5.16), satisfy estimate 
(5.11), namely 



dist T ((^ +J ,^ +J ), Vo i+i )) ■= dist[(9 t+3 ,Jp t+J ), (9T +i ,9o i+i ) + 

We have by Q5.16j ) that 



2ttZ 



rf+i 



< 5. 



(5.17) 



dist' 



i((§i + j,Tp i+j ), (9 l ,TpS) < dtetr ((Pi+j, Vi+j), (Si&i)) + j ^{^i+q-i - u*)(9 i+q - i+q -i) 



9=1 



< <V4 + 2C , 1 |ln/i|^|s i+(? _ 1 -s*| (by (fjif) and (a)) 

q=l 

< */4 + 2(7i|ln/i|}>*(ai + p._i-s i ) 

< 5/4 + 2C 1 \lnp\p 2 pp < 3(5/8, 



by and Q5JJ ). Therefore, by (jSj), 



dist 



< y + dist T ((^,^),(C^o < 
35 (5 



< 



Kpfip < 5 



by ( 5.14 ), provided /xe has b een c hosen small enough. 
There remains to prove ( 5.1 3| ) . By (*) we can write 



A m := {s E [S m ,S' m ] I p^r < dist( 7 ( S ), Q M ) < | } = PmM U [V^,U' m ], 

with S m < U m < V m < s* m < < U' m < S' m (in the case when uj* = cjj,_f, A m is just an interval). 
Moreover, by (a), s* m -V m , F m -s m > ^/Jl/\ \np\. Dehne A := \J r rn=1 A m . We have 9 k -9 1 = Co+YT m =i a m, 
where 

o"o := ^2 — <J m '■= ^ {Oi+i — di)- 

l<i<k-l,Si<£A l<i<k-l,Si£A m 

For Si g A, 6 i+1 -8i< 2d\\np\, hence er < 2C 1 k\\np\ < AdLXnpKpp). For i S A m , 9 i+1 -~9~i < 
47r(dist(aJi, Qm))^ 1 < 87r/ — s^|) by (*), and hence, using that by (|5~6|) s J+ i > Sj + pp/2, 



g„, < 



8tt 



E 



V 1 — ' \Sj — s 



1 „ 16tt 
' ~ vpp, 



E 



Si+l - s. 



i<i<fc-i,sieA„ 



Estimating the above sum with an integral we easily get 



0~m < 



16tt 



ds 



16tt 



- ^m) vpp J Um s* m - s v(y,' m - s* m ) vpp J v , s - s 



UL 



ds 



(5.13) can be easily deduced by the bound on s* m — V m , — s* 



In the next section we will prove the existence of a diffusion orbit (^j^Vi) close to the "unperturbed 
pseudo-diffusion orbit" (Tp(t),q(t)) : (9i,6k) — > R d+1 defined, for t e [0j,0 i+ i], as Tp(t) :— Tp i +ZJi(t — 0j) 

and l\[6ifi i+ i] :=< 9? <+1 -? 4 (" - ^) ( mod - 2?r )- 
6 The diffusion orbit 

We need the following property of the Melnikov function T(oj, •, •) defined w.r.t. to the variables (b, c) by 

f (w, 6, c) := T(lj, 9% +b,(p% + boj + c). 



Lemma 6.1 Assume that T(w, •,•) possesses a non- degenerate local minimum in (0q,(^q). r/ien i/iere 
exisi r > 0, 6 > 0, z/j > (j = 1,2) depending only on 7 sitcft i/iai Vw = 7(5), s S [0, L] 

• (i) <9 c f(w,o,c) ■c>f 2 >0or |$,f (cj, 6, c)| > i/ x > for |c| = r, |6| < 6, 

• (m) 9fcF(w, b, c) x sign(o) > v\ > /or |c| < r and b = ±6. 



Proof. We can assume that (5.4) is satisfied. Since r(w, •,•) possesses a non-degenerate minimum 
in (^ojVo)' b, c) possesses in (0,0) a non degenerate minimum. Hence we write T(uj,b,c), up to a 
constant, as T(lu, b, c) = Q 2 (k, c) + Qz(b, c) where Q2(b, c) =: (3 w b 2 /2 + (a w • c)b + ( 7aj c ■ c)/2 is a positive 
definite quadratic form G R, a w G R d ,7^ S Mat(d x d)) and Q 3 = 0(\b\ 3 + |c| 3 ). More precisely, by 
(5.4), there exists e > such that f3 u > e, and d w (c) := /3 w (7u;C ■ c) — (a u ■ c) 2 > e|c| 2 for all uj G t([0, L]). 
In addition, by the smoothness of T and the fact that u> = 7 (s) lives in a compact subset of R rf , there 
exists a constant M such that, Vw G j([Q,L]), \a u \ + \/3 u \ + \yj\ < M, |VQ 3 (6,c)| < M(b 2 + |c| 2 ). 

We have dbQ2(b, c) — f3 u b + a u ■ c and d c Q2(b, c) • c = ba^ ■ c + (j^c ■ c). 
Let us define V\ := inf a ,g 7 ([o,i]) e/(4|a w |) > and F 2 := mf^g^o,!,]) e/(4/? w ) > 0. Then consider 
i/i := Fir, i/2 = t 7 2?' 2 and 6 := r sup wg7 ([ ,L])(3i 7 i + la^l)//^, r G (0, 1]. We now prove that, provided 
r > has been chosen sufficiently small, conditions (i) and (ii) are satisfied with the above choice 
of the constants. Indeed if (|a w • c| + 2z7 1 r)/ / 3 a; < \b\ < b and |c| < r then dbt(uj,b, c) • sign(fe) > 
— l a w • c| — |3bQ3(b, c)| > 277ir — 0(r 2 ) > i^i for r sufficiently small. In particular this proves (ii). 
On the other hand if |6| < (\a u ■ c| + 277 1 r)//3 a) and |c| = r then 



d c T(uj, b, c) ■ c 



b(a UJ • c) + ( 7w c • c) + 9 C Q 3 (6, c) • c > ( 7(J c ■ c) - |b(a u • c)| + 0(r 3 ) 



> 



> 



er 2 + (a u ■ c) 2 - \t 



2V ir ) 



+ 0(r 3 ) 



lpM r 2 + (r 3 ) > ^-r 2 - 0(r 3 ) > 277 2 r 2 + 0(r 3 ). 



Hence (i) is satisfied for r small enough. ■ 

The partial derivatives of T are Lipschitz-continuous w.r.t. (b, c) uniformly in u> G 7([0, L\). Therefore, 
by lemma there exists 5 > such that, Vr/ G R with \t)\ < S, V£ G R d with |£| < <5, \/uj G 7([0, L]), 

<9 c f (w, 6 + ?/, c + £) • c > 3j/ 2 /4 > or \d b f(cu, b + rj, c + f)| > 3fi/4 > for |c| = r, |6| < 6, (6.1) 

<9 6 f (w, 6 + c + x sign(6) > 3^i/4 > for \c\ <r and = ±5. (6.2) 



Moreover let us fix p > such that 



p< min{^i/2,j/ 2 /r}/(6C 2 ), 



(6.3) 



where C 2 appears in (3.24). These are the positive constants (5, p) that we use in order to define, for 
< fi < pq , TDi, 6i, Tp i by lemma 5^ . 

Sinc e 7([0 , L]) is a compact subset of T> C N: inf s e [ jj\ /3( 7 ( a)) > and, by the choice of Oi, for /z small 
enough ( 3.21 ) is satisfied. Therefore, by lemma Bi> and ( 5.12| ), there exists p7 > such that, V0 < p < p7, 



k-l 



Ci\ 



2^A6 t + (b l+1 - h) 



k 

2=1 



T(ui, rji + bi, ^ + a) + R 7 , 



(6.4) 



where \r]i\ < 5, < S, R7 is given by ( |3.23j ) and satisfies ( 3.24 ). 
We minimize the functional on the closure of 

W:={(b,c):=(b 1 ,c 1 ,...,b k ,c k )eR {d+1)k | \h\ <b, \a\ < r, Vi = 1, . . . , *}. 



Since W is compact, attains its minimum in W, say at (b,c). By lemma 2.5 the existence of the 

we show 

Wi := Wi(b,c) 



diffusion orbit will be proved once we show that (b , c) G W, see lemma Let us define for i = 1, . . . , k—1 



A6i + (b i+1 - h) ' 



and wq = Wk = 0. From ( J5 ,9| ) and (3.14), Wi can be written as 



W.j 



By the expression of in ( |6.4[ ) we have, for all i = 1, . . . , k, 

d Ci T^,(b, c) = - w t + fid c T(lJi, rji + h, ii + Cj) + i?, 

d^T^b^) = ]^{\wi\ 2 - K-i| 2 ) + fjbdbT{uSi, 7]i + bi,£i + q) + 
where := d Ci Ri, Si :— d^Rr satisfy, by flS.24 ) and ( |6.3| ) 

1^1,15,1 <fmin{|,^}. 



5, 



(6.5) 

(6.6) 
(6.7) 

(6.8) 



By (6. 6)- (6. 7), a way to see critical points of is to show that the terms Wi-\ — Wi and \w%\ — |tu,-_i| are 
small w.r.t the 0(//)-contribution provided by the Melnikov function. By (3.8) — u7j| = 0(l/(9i + \ — 9/)) 
and hence, using ( |6.5|) , an estimate for each Wi separately is given by = O(\/\0i+\ — 9i\) + 0(/i/| ln/x|). 
Hence each \wi\ is 0(ju)-small if the time to make a transition — 9i\ = 0(1/ fi), as in J?]]. These time 
intervals are too large to obtain the approximation for the reduced action functional given in lemma 



3.5 and (6.4). Therefore we need more refined estimates: the proof of Theorem 1.1 (and Theorem 1.3) 



relies on the following crucial property for Wj := Wi(b,c), satisfied by the minimum point (b,c). 
Lemma 6.2 We have (for i = 1, . . . , k,) 



i) \wi-Wi-i\=0(fi), it) \wi\=o( 7== )- 



(6.9) 



Proof. Estimate ( |6.9| ) — i) is a straightforward consequence of (3.6) and fl6.8| ) if \ci\ < r, since in this 
case dc^^bjc) = 0. We now prove that ( |6 . 9[ ) z ) holds also if |cj| = r for some i. Indeed if \ci\ = r then 

d Ci J-f_i(b,c) = a^Ci for some a M < (6.10) 



(since (b,c) is a minimum point) and then by (6.6) , (6. 1C ) and ( p.q ) we deduce 

Wi-i -Wi = 0^ + O(n). (6-11) 

Let us decompose Wi-i and Wi in the "radial" and "tangent" directions to the ball S% — {\b%\ < b, \a\ < r}: 

Wi-i = a-iC-i + Ui with u% ■ Ci = (6-12) 

— Wi — a'/ci + u'i, with u\ ■ c, = 0. (6.13) 
Since |cj_i| < |cj| = r, |2j+i| < |c"j| = r, there results that 

air 2 = • Q > and a^r 2 — —Wi ■ Ci > 0, (6-14) 



so that aj,c4 > 0. Summing (6.12) and (6.13) and using (3.11) we obtain 

(a, : + a'^Ci + (ui + u'^ = 0(h) + a^Ci, 



with a,i, a[, - a,, > 0. T his implies that a M = 0(h/t) and from equation ( 6.11 ) we get (6.9)z). 

We can now prove (5.9) — ii). Let i$ £ {1, . . . , k — 1} be such that VI < i < k — 1, |u>i 1 > \wi\. For 
j G {1, . . . , k — 1}, j 7^ io we can write = Wi + Sj with sj — 53i=j — ^i) an( i hence, by (6.9)z) 



(6.15) 



for some constant C > 0. Hence 



3-1 



3-1 



i=ia i=ia 



,) + 5>( 



(6.16) 



> l^oll^ - Oi \ - cn\j - i \\9 s - e io \ = (\w la \- c»\j - iol) \e s - e l0 \. (6.17) 



and then by (6.15) 



Since \6 i+1 - 0i\ > d\ljxii\ +0(1) (by fl^)), Vt = 1, . . . , k - 1, \6j - 9 io \ > d\j - i \ ■ | ln/i|. Take 
j G {1, . . . ,k — 1} such that \j — i \ — [(a/MvI m Ml) -1 ] + 1 (such a j certainly exists since, by (5.7), 
k « l//i for ^ small). Then we obtain, using that |cj| < r for all i = 1, . . . , k, 



2r > 



(I 



> (\w in \-C- 



VJ 1 



- Cn)C x 



i.e. \wi \ < j__v^ ^_ nave ^- nus p rovec ] the important property (3.9) — ii). 

OiV|ln/z| 



Remark 6.1 By (d.aj), (uJi — Wj) = + 0(///| ln/x|), so t/ia£ 7 &?/ (\5.8\) , ( p.fl j implies 



K-Wil =o(— t==), - a;i| = OO). 



(6.18) 



A^ote i/iaf, /rom we would just obtain |Dj — = 0(l/|ln/i|). (6.18) can be seen as an a-priori 

estimate satisfied by the minimum point (6,(p). 

The following lemma proves the existence of a local minimum of the reduced action functional in the 
interior of W and hence of a true diffusion orbit. 

Lemma 6.3 Let (b,c) be a minimum point of over W. Then (b,c) G W , namely 

\ci\<r for all ie{l,...,k} (6.19) 

and 



\bi\ < b for all i G {1, . . . , k). 



3.20) 



PROOF. By (|6T9| ) we have ||u>i+i| 2 - |iui| 2 | < \u>i+i - • (|wi+i| + |to»|) = 0(p 3 ^ 2 ), and hence, from 
(6.7) we derive 

db^&c) = fidbf(j3i,TH + K£i+Ci) + 0(M 3/2 ) + Si. (6.21) 
Let us first assume by contradiction that 3i such that |ci| — r and |6j| < 6. In this case we claim that 

d c T(uJi, r\i + b t , & + cj) • c 4 < v 2 /2 and |t^,r(aJi, ^ + 6 t , & + ci)| < Ui/2 (6.22) 



contradicting ( ]6.l[ ), since \r]i\, < S. Let us prove (6.22). Since (6,c) is a minimum point 

dci^fiib.c) ■ Ci = (Wi-i - Wi) ■ Ci + ^<9 c r(uJj, rn + bi,£,i + Ci) -Ci + Hi -Ci = a^Zi ■ Zi = a^r 2 < 0. 



By (6.14) and (3.8) it follows that d c T(uji,r]i + + Ci) ■ Ci < 1*2/2. Moreover since |6j| < b we have 
9bi-7>(&>c) = 0, and by (6.21 ), ( |6.8D it follows that \dbT( ujj, rg + 6,,^ + Ci)\ < v\/2 (provided /j, is small 
enough). Estimate ( |S.2S ) is then proved. As a result, if fl6.20 ) holds, so does d6.19[) . 

Let us finally prove ( 6.2C ). If by contradiction 3i with =6, by (|6.21 ), (3.8) and since (b,c) is a 
mini mum point, arguing as before, we deduce that <9t>r(u7i, rji + bi, £i + Ci)sign(&i) < i^i/2. This contradicts 
(l6.2|) since |^| < 5. The lemma is proved. ■ 



PROOF of Theorem |l.l| . Lemmas [3j] and 2.3 imply the existence of a diffusion orbit := 
{fn(t),Qii{t),In(t),Pn(t)) with (fniOi) = oji + O(fi) and ip^dk) = + O(fi) (z M (-) connects a O(m)- 
neighborhood of 7^ to a O (/^-neighborhood of 7^ F in the time-interval (ti,T2) where ti := (0\ +02)/2, 
t~2 '■= (Ok-i + ^fc)/2). The estimate on the diffusion time is a straightforward consequence of ([[If) and 
the fact that Oik = #i.fc + O(l). That dist(7 Al (t), 7([0, L])) < ry for all t, provided [i is small enough, 
results from ( 6.1S| ) and the estimates of lemma |2.1| . 

Finally we observe that, if the perturbation is fi(f + /i/), then lemma 2.1 still applies with the same 
estimates. Moreover in the development of the reduced functional the term containing f gives, in time 
intervals Oi+i — Oi < const.\ ln/i|/^/7i, negligible contributions o(/i). Therefore the same variational proof 
applies. ■ 



Proof of Theorem |l.3| . If the pertur bation is of the form f(ip,q,t) = (1 — cosq)f(ip,t), by remark 
2.1-2), we ca n prov e that the development ( 3.22 ) holds along any path 7 of the action space (without any 
condition as (3.21)). Therefore the previous variational argument applies. ■ 

For f3 > small let T>^ N be the set of frequencies "/3-non-resonant with the perturbation" V^ N := {oj e 
R d I \u> ■ n + l\ > (3, V < \(n, l)\ < N}. If (3 becomes small with p, our estimate on the diffusion time 
required to a pproach to the boundaries of C n T)^ N slightly deteriorates. In the same hypotheses as in 



Theorem LI we have the following result. 



Theorem 6.1 Vi? > 0, V < a < 1/4, there exists /is > such that V0 < p. < pg, \/uji,ujf & 
C n n -Br(O) there exist a diffusion orbit (ip^(t), g M (i), I ^(t) , p ^(t)) of (S^) and two instants T\ < t 2 
with I^{tx) = ujj + O(n), I^(t 2 ) = oj f + O(p) and 

\r 2 -n\=0(l/p l + a ). 



(6.23) 



Proof. For simplicity we consider the case in which (3(uji) 
Theorem 



1.1 



0(p a ) and (3(w F ) = O(l). With respect to 
we only need to prove the existence of a diffusion orbit connecting loi to some fixed w* lying 
in the same connected component of T> c n C]Br(0) containing wj. In order to construct an orbit connecting 
loi to lo* we can define oJj := loi + i(uj* —LUi)/k 7 for < i < k and fc := [\lu* — u>i\/pp] + 1. We obtain that 
/3j = (iiuOj) > C(p a + jpp) for some C > and we choose 0j + i — Oj > const. (3j 2 verifying in this way the 
hypotheses of lemma [T^. If oji belongs to some Qm the transition times | ln/i|/ \fp needed to cross Qm 
(see lemma ^2) still satisfy ( 3.2l| ). We finally obtain a diffusion time Ok — 0i = YljZl(@j+i ~ Qj) = 0(1/ 
p 1+a ). M 



7 The stability result and the optimal time 

In this section we will prove, via classical perturbation theory, stability results for the action variables, 



implying, in particular, Theorem 1.2. We shall use the following notations: for I £ N, A C C and r > 0, 



we define A r := {z G C l | dist(z, A) < r} and := {z G C l | |Im zA < s, V 1 < j < 1} (thought of as 
a complex neighborhood of T l ). Given two bounded open sets B C C 2 , D C C 1 and f(I,<p,p,q), real 
analytic function with holomorphic extension on D a x T l s+a x B a for some a > 0, we define the following 
norm ||/||b_d s = ^2kpz' su P(p,?)es \fk(I,P, <?)|e' fc ' s where fk(I,P,q) denotes the /c-Fourier coefficient of 
the periodic function tp — > /(/, tp,p, q). 

Let us consider Hamiltonian TL^ defined in ( |l.l| ) and assume that /(/, f,p, q, t), defined in ( |l.2| ), is a 
real analytic function, possessing, for some r,f,r, s > 0, complex analytic extention on {/ G R d | |/| < 
r} r x Tf x {p G R I |p| < ¥} r x T s x T s . 

It is convenient to write Hamiltonian H.^ in autonomous form. For this purpose let us introduce the 
new action-angle variables (Io,<fo) with t — (po, that will still be denoted by / := (Jo, I\, . . . , I n ) and 
<p := (tp ,tpi, . . . ,tp n ). Defining h(I) := I + |/| 2 /2 and E := E(p,q) := p 2 /2+ (cosq- 1), is then 
equivalent to the autonomous Hamiltonian 



H := H(I, <p,p, q) := h(I) + E(p, q) + (if (I, <p,p, q). 



(7.1) 



Clearly, Hamiltonian H is a real analytic function, with complex analytic extention on 

j/eR d+1 |/|<r| xTf +1 x|peR |p|<r} x T s . 

In the sequel we will deno te b y z(t) := (I (t) , (pit) , p(t) , q(t)) the solution of the Hamilton equations 
associated to Hamiltonian (7.1) with initial condition z(0) = (1(0), ip(0),p(0), g(0)). 

The proof of the stability of the action variables is divided in two steps: 

• (i) (Stability far from the separatrices of the pendulum:) prove stability in the region 

£ x := Et U £ t ~ := { (I, p, q) \ E(p, q) > pf* } U { (I, p,p, 9 ) | - 2 + < £(p, g) < } 

in which we can apply the Nekhoroshev Theorem obtaining actually stability for exponentially long 
times, 

• (ii) (Stability close to the separatrices of the pendulum and to the elliptic equilibrium 
point:) prove stability in the region 

£ 2 :=£+U£ 2 - := {(I,cp,p,q) | -2^ < E(p, q) < 2^}u{(7, cp,p, q) \ -2<E(p,q) < -2 + 2^) 
in which we use some ad hoc arguments, 



where < Cd < 1 is a positive constant that will be chosen later on, see (7.12). 

We first prove (i). In the regionspj ; — Tlq p£^~ we first write trie pendulum Hamiltonian E{jp^ Q^j in 
action-angle variables. In the region] 

£t U {p > 0} the new action variable P is defined by the formula 
V2 



P := P + {E) := 



\/E+ (1 + cosip)dip. 



while in the region £ 1 the new action variable is 

P;= P -(E) = -± ' 



\/E+ (T+cos^) dip 



where ipo(E) is the first positive number such that E + (1 + cosipo(E)) = 0. We will use the follow- 
ing lemma, proved in flio| |, regarding the analyticity radii of these action-angle variables close to the 
separatrices of the pendulum. 

Lemma 7.1 There exist intervals D ± C R, symplectic transformations <f> = ± (P, Q) real analytic on 
D x T with holomorphic extension on D^ o x T So and functions E real analytic on D ± with holomorphic 
extension on D^r Q such that (jr iz (D ± x T) = £^ and 

E{cj ) ± {P,Q))^E ± {P), 

with ro = constfj,° d and sq — const/ | In fi\ . Moreover, for E bounded, the following estimates on the 
derivatives hold^ 

dE ± 1 



±- 



dP 

d 2 E ± 
dP 2 



(P ± (E)) 
(P ± (E)) 



ln- L (l 
1 

W\ 



\E 







hT 3 (l- 



\E\ 



(7.2) 
(7.3) 



7 n Pi q denotes the projection onto the (p, q) variables. 
8 The case with p < is completely analogous. 

9 If f(x),g(x) are positive function, with the symbol / ~ g we mean that 3ci,C2 > such that c\g(x) < f(x) < 
C2g(x), Vi. 



After this change of variables Hamiltonian H becomes 

H* := H± (I, <p, P,Q)~ h ± (I, P) + nf* (I, <p, P,Q):= h(I) + E ± (P) + M /± (I, <p, P, Q) 

where /± (/, <p, P Q) := /(/, <p, 0± (P, Q)). 

Stability in the region £^ . In the region £± , the proof of the stability of the actions variables 
follows by a straightforward application of the Nekhoroshev Theorem as proved in Theorem 1 of |Q . In 
order to apply such Theorem we need some definitions. For I, m > 0, a function h := h(J) is said to be 
l,m-quasi-convex on A C R d+1 , if at every point J e A at least one of the inequalities 

\(h'(j),o\>m, (h"(j)^o>m\e 

holds for each £ £ R d+1 . Using the previous lemma it is possible to prove that, for every r > 0, the 
Hamiltonian h + is l,m-quasi-convex in the set 5 := D+ x {/ e R d+1 | |/| < r} ro with l,m = 0(1). In 
the previous set also holds 

\\(h+)"\\ =: M = 0(jT^ ln- 3 (l//i)) , \\(h + )'\\ =: tt = 0(1). 

Putting e := ^\\f + \\s,s = O(p), e := 2- 10 r 2 m(m/llAf ) 2 ( d+2 > = 0(^ 2cd ( d + 3 ) ln 6(d+2) (l/ M )), a := 
(l-2cd(d+3))/2(d+2) we obtain that, if the initial data (1(0), </?(0),p(0), g(0)) € £^ , that is P(0) € £>+, 
then 

\I(t) - /(0)| < const. u a hT 3 (l//z) , for |i| < const. exp(const.u- a ln 2 (l/^)). (7.4) 

If Cd < l/2(d+ 3) then a > and we obtain stability for exponentially long times. 

Stability in the region £7 . In the region £7 we cannot use the Nekhoroshev Theorem as proved in 
fl1| , because E~ is concave and so hr is not quasi-convex. However we can still apply the Nekhoroshev 
Theorem in its original and more general form as proved in |ll| (see also pjj); in fact the function hr 
proves to be steep (see Definition I.7.C. pag. 6 of p6l). 

For simplicity we prove the steepness of the function h~ in the case d — 1 only. In this case hr = 
h~(Io, Ji,P) = Iq + if /2 + E~(P). We need more informations on the function E~ . In the following, in 
order to simplify the notation, we will forget the apex - writing, for example, E = E~ and P = P~ . 
By (1.11) of |l6[, since Vh~~ ^ 0, a sufficient condition for hr to be steep is that the system 



Vi+IV2+E'{P)n 3 
r&+E"{P)r,l 
E"'(P)4 



= 
= 

= (7.5) 



has no real solution apart from the trivial one r/i = = n 3 = 0. 

Making the change of variable tp — arccos(l — E + £E), where E = E + 2, we getF 



P(E)= [ F^E)^, P(E)=3~ 1 / 2 [ F 2 (Z;E)d£, P(E) = C F 3 (£; E) d£, (7.6) 
Jo Jo Jo 



where 



F2&E) 



V2 



^vevw - E 



10 We will denote with " ' " the derivative with respect to E, and with " ' " the derivative with respect to P. 



From the equation E(P(E)) = E, deriving with respect to E, we obtain that 

E"'(P(E)) = -(P(E))- 5 [P(E)P(E) - 3(P(E)) 2 }. 

We want to prove that 



E"'{P{E)) < 0. (7.8) 



for every E with -2 < E < 0. This is equivalent to prove that P(E)p(E) > 3{P{E)) 2 . Using (|TV7]) we 
see that FiF 3 = F% and hence, noting that F 3 (£;E) is not proportional to Fi(£;E) for every E fixed, 
we c onclude that J F\J Fa > (J F2) 2 by a straightforward application of Cauchy-Schwarz inequality and 
Q follows from Q. 



By ( |7.8j ) the unique solution of the system (7J5) is the trivial one r\\= r\2— ?l3 = 0, hence the function 
/i~ is steep. It is simple to prove that the so called steepness coefficients and steepness indices (see again 
Definition I.7.C. pag. 6 of can be taken uniformly for —2 + p Cd < E < —p Cd : that is they do not 
depend on p. 

Now we are ready to apply the Nekhoroshev Theorem in the formulation given in Theorem 4.4 of (lq . 
In order to use the notations of we need the following substitutions^]: 

(I,P)->I, {<p,Q)^<p, H~^H, h~ > Hq, pf~^H u r ^p, 

{I e R d+1 I |/| < r} x D~ -> G, {Ie K d+1 \ \I\ < r} ro x T^ 1 x D+ x T So -> F. 



Defining m :— sup F \\ | and remembering (7.3) and the definition of ro, we have 

m < const. pT Cd ln~ 3 (l/^i), p = const. n Cd . (7.9) 
In order to apply the Theorem we have only to verify the following condition 

M :— sup |i7i| < Mq (7.10) 

F 

where Mq depends only on the steepness coefficients and steepness indices (which are independent of /i) 
and on m and p (which depend on fj.) . Moreover we use the fact that the dependence of Mo on m and 
p is, "polynomial" (although it is quite cumbersome): that is there exist constant Cd,c~d > such that 
Mo(m, p) > const. m~ Cd p Cd (see §6.8 of |l7| ). So condition (7.10) becomes, using (7.9), 

p < const. p Cd ^ d+Zd) ln 3cd (l/^), 
which is verified choosing a < (cd + Cd)^ 1 ■ 

Now we can apply the Nekhoroshev Theorem as formulated in Theorem 4.4 of |ll|, obtaining that if 
(/(0),v>(0),p(0),?(0)) Gffthen 

\I(t) - I(0)| < d/2 := M b /2 = 0(p b ) V \t\ < T := ^ exp (^) Q = o(icxp (i)") (7.11) 

where a, b > are some constants depending only on the steepness properties of Hq. Finally, choosing 

c d <min{(2d + 6)- 1 , (cd + Cd)- 1 }, (7.12) 
we have proved the exponential stability in the region £ 1 . 

Stability in the region . In the following we will denote I* := . . . , Id) the projection on the last 
d coordinates. We shall prove the following lemma 

Lemma 7.2 Vk > 0, 3k ,/i 8 > such that V < p < p$, if (I(t),(p(t),p(t),q(t)) e for < t < T , 
then 

\I*(t)-I*(0)\ < I Vt<min{— ln-,T}. 



'We observe that we do not need to introduce the (p, q) variables so in our case C = +00. 



It is quite obvious that for initial conditions (1(0), ip(0),p(Q), g(0)) £ £2 , Theorem 1.2 follows from lemma 



7.2 and the exponential stability in the region £\. 

In order to prove lemma |7.2| let us define, for some fixed < <5 < 7r/4, the following two regions in 
the phase space : U :— {(I, <p,p, q)\ \q\ < 6 mod 2tt, \E(p,q)\ < 2p Cd } and V :— {(I, cp,p, q)\ \q\ > 5 mod 
2tt, \E(p,q)\ < 2p Cd }. We first note thatj^] 

z(t) eV Vii < t < t 2 , |«(*i)|,|g(*a)| - 6 mod 2tt =► t 2 -t x < a, \I(t 2 ) - I(h)\ < c 2 (t 2 -t x )p. (7.13) 

Indeed in this case Vii < t < t 2 , C3 < \q(t)\ < C4. This implies that t 2 — t\ < c\ and then, integrating 
the equation of motion / = —pd v f in (t\, t 2 ), we immediately get ( |7.13j ). We also claim that 



Vii < t < t 2 , z(t) e U and |g(*i)|, \q(t 2 )\ = 5 mod 2tt =>• t 2 - h > c 5 | ln(i\. (7.14) 

We denote with t\j (resp. t l v ) the i-th time for which the orbits enters in (resp. goes out from) U, so 
that t\j < t l v < tj} 1 < t^ 1 for < i < i Q . From ( fT.14j ) it follows that i < c e n Q /p and, from ( |7.13[ ), that 



the time Ty spent by the orbit in the region V is bounded by C7K0/11. 



In order to prove (7.14) we use the following normal form result for the pendulum Hamiltonian E(p, q) 



in a neighborhood of its hyperbolic equilibrium point (see e.g. |12| ) 

Lemma 7.3 There exist R,S > 0, an analytic function g, with g'(0) = —1 and an analytic canoni- 
cal transformation $ : B — > {\p\ < 6} x {|q| < S mod 2tt} where B := {\P\,\Q\ < R}, such that 
E(t>(P,Q))=g(PQ). 

In the coordinates (Q, P) the local stable and unstable manifolds are resp. Wf oc = {P = 0} and 
W t u oc = {Q = 0} and Hamiltonian (fTl]) writes as 

H := H(I, <p, P, Q) := h(I) + g(PQ) + <p, P, Q) 

where f(I,<p,P,Q) := f(I, <p, S(P, Q)). 



We are now able to prove ( 7.14 ). Certainly there exists an instant t* S [ti,t 2 ) for which (p(t*), q(t*)) € 
$(P) but, Vti < t < t*, (p(t),q(t)) <&(B). It follows that, if we take the representant q(ti) e [-6,6], 
then p(t$)q(tf) < 0. We will denote with Z(t) := (I(t),ip(t), P(t),Q(t)) = (I(t),(p(t),^ 1 (p(t),q(t))) 
the corresponding solution of the Hamiltonian system associated to H. From the fact that |g(i*)| = 8 
or (p(**),g(t*)) G d$(B) and that |ff(PQ)| < ^ c<i , p(**)g(**) < 0, it follows that \P(t*)\ < c$p Cd and 
\Q(tl)\>c 9 . 

In the same way there exists an instant t| with ii < t\ < t 2 < t 2 for which (P(t 2 ),Q(t 2 )) £ B but, 
Vi > £3 (P(t),Q(t)) 4- B\ in particular it results |P(t|)| > c w . We claim that t% — t* > en ln(l///). 
Indeed P(t) satisfies the Hamilton's equation P(t) = -g' (P(t)Q(t))P(t) - ^d Q f(I(t),ip(t),P(t),Q(t)) 
with initial condition |P(^)| < cs/j, Cd . Since |P(t|)| > cio, we can derive from Gronwall's lemma that 



t 2 — t* > cnln(l//i), which implies (7.14) 



By the following normal-form lemma there exists a close to the identity symplectic change of coordi- 
nates removing the non-resonant angles ip in the perturbation up to 0(/i 2 ). It can be proved by standard 
perturbation theory (see for similar lemmas section §5 of fl^] ). 

Lemma 7.4 Let [3 > 0. There exist R, p > so small that, defining A := mini^i<ij2 |</(£)|, S :— 
max| ? |< fi 2 \g"(£)\, then A > 2S7? 2 and p < min{A/47V, p 2 /8s, /3/2N, r}. Let A be a sublattice ofZ d+1 . 
Let T> C R d+1 be bounded and /3 -non-resonant mod A, i.e. VI G T>. h € Z d+1 \ A, < N it results 
|(1,/*) • h\ > j3. Suppose that 

e-= p\\]\\b,d, s <2- n f3,ps, (7.15) 
wheref^ D := V p , /3* := min{/3, A/2}. Then there exists an analytic canonical transformation 

* : Dx T d +} xB — ► Dx T d+1 x B 

_ s, l_ (7.16) 

(h]f,P,Q) •— » (i,<p,p,Q) 

12 In the following we will use to denote some positive constant independent on fx. 
13 B and D are thought as complex domains, as in the sequel B and D. 



with B := {\P\, \Q\ < R/8}, D := D p/4 , such that 

H := 77(7, Tp,~P,~Q) :=ffo$ = h(7) + g(I, Tp, PQ) + 7(7, P, Q) 

wtthg(7,7p,£) := ff (0+/*(7, /*(7,^,0 = E/ ieA , | fc |<^/fc(7,0e ih ^ ll/*b,A S /4 ^ e - drawer 
i/ie following estimates hold 

n-/i<g, |P-ni 5 -<3i<^, ii7ii 5iE ,. M <g. (r.iT) 

Let £ be the (finite) set of the maximal sublattices A = (hi, . . . , h s ) C Z rf+1 for some independent 
hi G R rf+1 with \hi\ < N for i = l,...,s < d. For A € £ we define the A-resonant frequencies 
i? A := {J* € R d | (1, 7*)-/i = 0, V h G A} and the set of the s-order resonant frequencies Z s := UdimA=s-R A . 

Setting hi = (Zj, rii) with ^ S R, ri,, € R d , we remark that if 7? A ^ then ni, . . . ,n s are independent. 
We also define the (d — s)-dimensional linear subspace (associated with the affine subspace R A ) L A := 
^i=i n t C R d and we denote by II A the orthogonal projection from R d onto L A . 

Since £ is a finite set, a :— minA££ min neZ d ,\ n \<N,n A n^o |n A rt| is strictly positive. 

We now perform a suitable version of the standard "covering lemma" in which the whole frequency 
space is covered by non-resonant zones. The fundamental blocks used to construct this covering will be 
r-neighborhoods of any R A i.e. R A := {/* S R d | dist(7*,7i A ) < r} for suitable r > depending on 
dimA. Let r<j > be such that (d + l)rd < C12K, for some C12 sufficiently small to be determined. For 
1 < s < d — 1 we can define recursively numbers r s sufficiently small such that < r s < ar s+ i/2N ', 
verifying]^ 

A j:™ a ' „ dA _l dA' ». t 

T-(s+l)r s 1 1 -"-(a+l) 

We also define, for 1 < s < d-l, S° := R d \(uf =1 Z| r J and S s := ^ ( s s+1)r . s \ (ut s+ i^ +2)ri ), i.e. the s- 
order resonances minus the higher-order ones. We claim that R d = S°Li. . .L)S d ^ 1 LlZ^ d+1 - )rd is the covering 
that we need. We also define S° C S° := R d \ (U? =1 3* 4 ) and S* 5 C := Z ( s s+1)rs \ (ut s+ iZ ( ' s+1)ri ). 

If the orbit lies near a certain R A (but far away from higher order resonances) then the following 
lemma says that the drift of the actions 7* in the direction which is parallel to R A is small. 

Lemma 7.5 Suppose that 7*(0) G S s , I*(t) G 5| and |7*(i)| < f + r/2, V0 < t < T* for some 
T* < Ko\ In fi\/fi and < s < d — 1. Then, if s > 1, t/iere exists a sublattice A C Z d+1 , dimA = s suc/i 
f/iaf 7*(t) € ^f s+ i )r , \ ( u i=a+i^( 8+ i) r< )' V0 < /; < T*. Moreover i/« is sufficiently smal^\ 

|n A (r(<) - r(o))| < ri/2 vo<t<r (7.19) 

and hence, for s > 1, |7*(i) — (0) | < 2(s + l)r s + ri/2. In particular for 7*(0) € S" me /lave i/wrf 
|J*(i) - 7* (0)| < ri/2, V0 < t < T* . 

Proof. In the case s = we take A = {0}. The existence of A is trivial because 7*(0) € S s and 
hence 7*(0) G i$ +1)r<> for some Ae£ with dimA = s. The fact that 7*(t) G i? A +1)?% \ (uf =s+1 Z ( l s+1)r .), 



dimA = dimA' = s, R A ^ R A ' => ijji+iv, n i2j,' + iV. c u i=s+i^*- ( 7 - 18 ) 



V0 < t < T*, follows from 7*(t) € SI, V0 < t < T* and (|7.18|) . Now we want to apply lemma \IA 
with P := ari/2 and V := i? A s+1 j r \ (uf =s+1 Z^ s+1 ^ r .). We have to verify that V is /3-non-resonant 
mod A. Fix \h \ < N, h — {lo,n ) A (resp. ^ for s — 0). We first estimate |Z + n ■ Iq \ for all 
7* G T> ■= R A \ ( u i=«+i^( s +i) r< )- If A ' := A © (^o) and n* := Tl A n we have two cases: n* ^ or n* = 0. 
In njj ^ we can perform the following decomposition: 7g = I^+v with 7^ G Ti! A , v G 7 A and moreover^ 
v = ±\v\n* /\n* \. Since 7 * £ (\jf =s+l Z\ s+l)r .) then 7 * £ ^ ( A ' +1)rs+1 and, hence |«| > (s + l)r s+1 . Using 
the previous estimate, the fact that II G A' and \n^\ > a, we conclude that 

|Z + n -7*| = | (Z +n ■ I*) + n ■ v\ = \n ■ v\ = \n* ■ v\ = \v\\n*\ > a(s + l)r s+1 . (7.20) 

14 Assumption J7. 1 jj ) means that, in order to go from a neighborhood of a (d — s)-order resonance to a different one, we 
have to pass through an higher order dimensional one. 

15 In the case s = II A is simply the identity on R d . 
16 We observe that dist(J^ , R A ' ) = \v\. 



Now we consider the case in which ti'q = 0. In this case it is simple to see that ho = (l',0) + h where 
h G A and V G Z \ {0}. So \l + n ■ 7 * | = \V\ > 1. Now we can prove that \l + n ■ I* \ > f3 for all 7* G V. 
In fact I* = Jg + tt with 7g G 2?o an d M < ( s + 1)?V Using ( 7.2C| ) and r s < ar s+1 /2N, we have 



\lo + n ■ J*| > |l + n • | - |n • u| > a(a + l)r s+ i - JV(s + l)r a > a(s + l)r s+1 /2 > (3, 



proving that T> is /3-non- resonant mod A. Finally we c an verify ( 7.15 ) if /is is sufficiently small. Now we 
are ready to apply lemma 7_A i n ord er to prove ( 7.1 S| ) . Using ( 7.13| ), the fact that /* contains only the 
A- resonant Fourier coefficients, ( 7.17 ) and Hamilton's equation for H we have 



|II A (/*(t) - 7*(0))| < c 2 T V fi + ci 3 ^ 2 (k | ln/^l/ju) + ci 4 i ^ < c 2 c 7 n a + ci 3 ^k | In ju| + c u c 6 n a < n/2 
if kq and /zs are sufficiently small. ■ 



PROOF of lemma \T2\ Suppose first that \I*(t)\ < r + r/2 V0 < t < n \ ln/z|//i. If 7*(0) G Zf d+1 ^ and 



I*(t) G Zf d+1 ) r4 V0 < i < K |ln^|//ithen | /*(*)- J*(0)| < 2(d+l)r d and the lemma is proved if c i2 < 1/4. 
Otherwise we can suppo se t hat 7*(0) G S s for some < s < d— 1. If I*(t) G fijl V0 < t < Ko\ ln/it|//z then 
we can apply the lemma 7J) proving the lemma for C12 small enough. Suppose that 3 < T* < kq\ In /z|//x 
such that I*(t) G V0 < t < T* but I*(T*) <£ S£. We will prove that 



I*(T*) eS°u...uS s 



(7.21) 



that means that the orbit can only enter in zones that are "less" resonant. In fact by lemma we 
see that I*(T*) <£ U^ =s+ i^L +1)r ., moreover, since I* (T*) £ S*, we have that I*(T*) £ Zf +1 . and 
hence 7*(T*) (/ Uf =s Zi +1)r .. If I*(T*) G 5° we have finished. If 7*(T*) ^ 5° then 7*(T*) G U?; 1 ^. 

If 7*(T*) G S 1 we have finished. If 7*(T*) (£ S 1 then 7*(T*) ^ Z\ rx \uf =2 Z| rj and hence 



i=l ^(i+l)ri 



7*(T*) G U?Z2^ +1)r ..- Iterating this procedure we prove ( ^g| ). 

The conclusion is that if the order of resonance changes along the orbit, it can decrease only so that the 
orbit may eventually arrive in the completely non resonant zone S° where there is stability. Considering 
the "worst" case i.e. when 7*(0) G Z d d+1 y^ an< ^ ^he orbit arrives in S°, summing all the contributions 
from lemma 7.5, we have that, if cn is sufficiently small, 



d-l d 

\I*(t) - I* (0)1 < 2(d + l)r d + ^( 2 ( s + l ) r > + r i/ 2 ) + r i/ 2 = 2 ( s + 1 ) r * + rfri / 2 - K / 2 - ( 7 - 22 ) 



In order to conclude the proof of the lemma we have only to prove that if |7*(0)| < r then |7*(t)| < r + r/2 
V0 < t < kq\ ln/x|//i. This is an immediate consequence of (7.22) and of the fact that k < r. ■ 



Stability in the region £ 2 . If, for all f > (p(t), q(t)) G £ 2 , then it follows easily that \q{t) — 

tt| - 0(p c */ 2 ). Then, defining ^(J.p) := /(7,^0,7r) and / 2 (7,^,f) := ^^^/(/^.^^(i)) - 
/i(7, <p)], it results that \difa{I, <p; t)\, \d v f2(I,<p;t)\ < const. Clearly if (I{t),<p(t),q(t),p(t)) is a solution 
of (7.1) then (I(t),ip(t)) is solution of Hamiltonian 



77x := 77i(7, t) := h(I) + ^(J, V ) + u 1+ ^f 2 (I, V] t). 

NowQ one can construct, in the standard way, an analytic symplectic map $ : (I, ip) — > (7, ip) with 
|7 — 7| = 0(n/(3), and two analytic functions h, f such that [h + pbfi] o $(7,^) = /i(7) + f(I,p) with 
11/11 = 0( M 2 ). Defining f 3 := / 3 (7,^;t) := / 2 (*(T,^);t) we also get that 1^/3(7,^4)1, |%f 3 (7,^;i)| < 
const./ (3. The solutions of Hamiltonian T/i are symplectically conjugated, via to the solutions of 

the Hamiltonian 

77 2 := H 2 (l,lp;t) :=h(l) +J(I,lp) + M 1+ ^/ 2 ) / 3 (7,^;i) 
for which we obtain, directly from Hamilton's equations, the estimates 

|7(t) - 7(0)1 < coTisi • /i Cd/4 , V \t\ < const ■ ^ 1 - Cd/4 . 

17 For brevity we prove only the case in which 1(0) is in a non-resonant zone. The resonant case can be treated as in . 



It follows that, if (J(0),<p(0),p(0),g(0)) € £ 2 ~, then 

\I(t) - I(0)\ < \I(t) - I(t)\ + \I(t) - 7(0)| + |7(0) - /(0)| < const.^/ 4 , V |t| < const.^ 1 -^^ 
(if at some instant t the solution z(t) escapes outside it is exponentially stable in time). 

Finally, from the previous steps, we can conclude that there exists fi\ > such that < /J, < /ii 



Theorem 1.2 holds 



8 Appendix 

Proof of lemma EO. We shall use the following lemma: 



Lemma 8.1 There exists Tq > suc/i that, VT > To, /or all continuous f : [— 1,T+ 1] — * R, £/iere exists 
a unique solution h of 

- h + cos Q T (t)h = f, h(0) = h(T) = 0. (8.1) 
TTie Green operator Q : C°([-1,T+ 1]) -> C 2 ([-1,T + 1]) de/med 6?/ 0(/) := safe/?es 

max |/i(t)| + |fe(t)| < C max \f(t)\ (8.2) 
te[-i,r+i] te[-x,r+x] 

for some positive constant C independent of T . 

PROOF. We first note that the homogeneous problem flS.ip (i.e. / = 0) admits only the trivial solution 
h = 0. This immediately implies the uniqueness of the solution of ( |8.l| ). The existence result follows 
by the standard theory of linear second order differential equations. We now prove that any solution h 



of (8.1) satisfies (8.2). It is enough to show that max t£ [_ ljT+1 ] \h{t)\ < C max tS [_ liT+1 ] \f(t)\. Indeed 
we obtain by (8.1) that max ie [_ 1T+1 ] + — (2C + 1) max tG[-x,T+i] |/(*)| an d, by elementary 

analysis, this implies ( ^.2| ) for an appropriate constant C. 

Arguing by contradiction, we assume that there exist sequences (T n ) — > oo, (f n ), (h n ) such that 

-h n + cosQ Tn (t)h n = f n , h n (0) = h„(T n ) = 0, \h n \ n := max \h n (t)\ = 1, |/„| n — ► 0. 

te[-x,r„+x] 

By the Ascoli-Arzela Theorem there exists h G C 2 ([— 1, oo), R) such that, up to a subsequence, h n h 
in the topology of C 2 uniform convergence in [—1, M] for all M > 0. Since Qt„ — ► 9o — 27r uniformly in 
all bounded intervals of [—1, oo), we obtain that 

-h + cosq (t)h = 0, h(0)=0, sup \h(t)\<l. (8.3) 

te[-x,oo) 



Now the solutions of the linear differential equation in fl8.3|) have the form h = K\£ + K21/), where 
{K U K 2 ) S R 2 , = gb(t) = ^ and ^>(t) = i(sinht + ^) satisfies ^ - ^ = 1. The bound on /i 
implies that K2 = and /i(0) = implies that K\ — 0. Hence h = 0. In the same way we can prove that 
h n (- — T n ) —> uniformly in every bounded subinterval of (—00, 1]. 

Now let us fix t such that for all n large enough, for all t S [£, T n — t\, cosQr„(i) > 1/2 (i does exist 
because of ( [2.4|) ), By the previous step, for n large enough, there exists a maximum point t n S (t, T„ — I) 
of h 2 n (t), i.e. /i 2 (t„) = \h n \l = 1. Then (/i' 2 )(t„) = 2h n (t n )h n (t n ) = and (h 2 )(i„) = 2h n (t n )h n {t n ) + 
2/i 2 (i) < 0. By the differential equation satisfied by h n , we can derive from the latter inequality that 
cosQT n (tn)hn(tn) < fn(tn)h n (t n ), i.e. cosQT„(t n ) < f n {t n ), which, for n large enough, contradicts the 
property oft and the fact that |/ n |„ — * 0. ■ 



Now we can deal with the existence result of lemma 2.1. Let T := (9~ — 6 + ), u> = (ip~ — ip + )/T, 
Tp(t) := u(t — 6 + ) + <p + . In the following we call Cj constants depending only on /. We are searching for 
solutions (<p,q) of (2.1) with (fi(9 ± ) — <p , q(d ± ) = T 71 ", in the following form 



tp{t) = uj{t - 6+) + ip+ + v{t - 6+) 

q{t) = Q T {t-e+) + w(t-e+). 



Hence we need to find a solution, in the time interval / := [—1, T + 1], of the following two equations 



v(t) = - t i[F (p (v,w)](t) ) 



v(0) = v(T) = 0, 



[L(w)](t) = [G(v,w)](t) := -[S(w)](t) + fx[F q (v,w)](t), w(0) = w(T) = 0, 



(8.4) 



where 



[F v (v,w;X,(j,)](t) 
[F q (v,w;X,n)](t) 
[S(w)](t) 
[L(w)](t) 



d v f(uit + <p++ v(t), Q T {i) + w(t), t + 0+), 
d q f(ujt + ^++ v(t), Q T (t) + w(t),t + 9+), 
sin(Q T (t) + w(t)) - sm(Q T (t)) - cos(Q T (t))w(t), 
—w(t) + cos QT(t)w(t). 

We want to solve ( fe.4| ) as a fixed point problem. By lemma |8.l| , the second equation of ( |S.4| ) can be 
written w = K := G(—S + fJ,F g ). Moreover the first equation (8.4) can be written 



v(t)=J(t) := [J(v,w;\,n)](t) := J(t) 
where, setting F v (s) = F v (v(s),w(s)), 



J(0)(T -t) + J(T)t 



.5) 



[J(v,w; A,/i)](t) := —fi I I F v (s) ds dx. 

JT/2 JT/2 

Let us consider the Banach space Z — V x W :— C 1 (I;H d ) x C 1 (/;R), endowed with the norm 
||z| = ||(v,w)| := max{||w||v, defined by 



\V\\v := sup 
tei 



K*)|(l + c 1 vT 2 )~ 1 P 2 + Wt) I/? 



\W\\w ■= sup 
tei 



\w(t)\ + \w(t)\ 



(8.6) 



A fixed point of the operator <f> : Z — ► Z defined Vz G Z as &(z) :— (f>(z; A, /i) := (J(z),K(z)) is a solution 
of (8^). We shall prove in the sequel that <!> is a contraction in the ballQ D := B-^zj for an appropriate 
choice of c, ci, Co, provided /i is small enough. 

We have |[S(iu)](i)| < w 2 (t), so that Vi, \[G(v, w)](t)\ < c 2 ^ 2 + c±[l. Now, choosing first c sufficiently 
large and then /i sufficiently small, we can conclude using (B.2) that, if z € D, \\K(z)\\\y < c/x/4. Now we 
study the behaviour of J. Let us first consider J. We define 

fm(t) := fni(Q T (t) + w(t)), g nl (t) := f^QAt) + w(t)), 

u n i := n- (p + + W + , (3 n i := n-u; + 1. 
For t E [— 1, T + 1], z € D, we want to estimate 

j(t) = -n f F v = —fj, [neian ' f fni(s)e in - v ^e if3 ^ s d S . 



T/2 



|(n,0|<JV 



T/2 



Integrating by parts, we obtain 



iPm / fni(s)e in - v ^e^ s ds 

JT/2 



T/2 
t 

T/2 



A^Qrisy^e^ds 
(9ni(s)w(s) + fm(s)in ■ v(s))e m - v ^e^ s ds. 



18 If X is a Banach space and r > we define B r (X) := [x € X; \\x\\ < r}. 



(8.7) 
(8.8) 
(8.9) 



By (I 



the term (8.8) is bounded by C5 max{e K2t ,e K ^ T ')}. Hence, for z G D, 



C6 



F lp =u{t)-u{T/2)+R(t), with \R(t)\<- 

T/2 P 



max J e -^* >e -^(T-*) 



}+c(m+|)t], (8.10) 



where u(i) = EW^O^"'/^)^^"'*- 

So we can write J(t) = j(t) + fi(t - T/2)u(T/2), where 



-fiu(s) ds - 



T/2 



-fiR(s) ds. 



T/2 



By the bound of R(t) given in ( |3.10 ), the second integral can be bounded by c 7 (m/#)[1 + cT 2 fi//3}. 
Integrating once again by parts as above, we find that the first integral is bounded by cs(^//3 2 )[l + 
c(/iT//3)], hence, by the condition imposed on [iT, it can be bounded by /j,c/8/3 2 , provided that Co has 
been chosen small enough and c is large enough. Hence 



fic 



— + c 7 /iT + - 
c 8 



In addition 



^<<)|=rt^) + *(0l<c 10 f(i + f). 



As a result \\j\\v < Mc/4, provided c and ci have been chosen large enough, Co small enough. 

Now J(t) = j(t) + at + b, where a, b € R, so that we may replace J with j in flS.5p . Since |J(i)| < 
|j(t)|+max{Jj(0)|,|j(7 1 )|}(T + 2)/T and \j(t)\ < \dj(t)/dt\ + (1/T) \dj(s)/dt\ ds, we obtain ||J|| V < 
3||j||y < /i3c/4. We have finally proved that $ maps D into itself (in fact into B^u). 

Now we must prove that $ is a contraction. <I> is differentiable and for z = (v, w) £ D, (D$>(z)\h, g]){t) 
= (r(t),s(t)), r and s : [-1, T+ 1] -> R being defined by 

f(i) - ai {t).h{t) + h(t)g{t), r(0) = r(T) = 0, L(«)(t) = a 2 {t).h(t) + b 2 {t)g{t), s(0) = s(T) = 0, (8.11) 
where 

01 (t) = -pid vv f(u)t + v(t),Q T (t) + to(*), * + 0+), 

h(t) = -fid^fiut + <p+ + v(t),Q T (t) + w(t), t + 0+), a 2 (t) = -hit), 

62 (t) = cos(Q T (t) + w(t)) - cosQ T (t) + fid qq f(ut + <p+ + v(t), Q T (t) + w(t), t + 0+). 

By the same arguments as above (A, Z?) G Vi x V (where V% := C 1 (I, R^ 2 )) defined by 
I(i) = ai(f), A(0)=A(T) = 0, B(t) = &i(f), 5(0) = B(T) = 



satisfy ||A||vi + | j v" ^ c n c M (II ||vi being defined in the same way as || ||y). 
Using an integration by parts, we can derive from ( |B.ll| ) and the bound on 



\B\\v that 



|r(t)|<ci 2 cg[( 



1 + C!/ZT 2 



/lk)+r( 



+ llsllw 



7? LV /?2 

Therefore, for Co small enough, |/3r(t)| < l/8max{||/i||y, ||<7||iy}- We derive also from ( 8.12 ) that 

■f/T cxfj, 2 T 3 fiT 2 - 



.12) 



\r(t)\ < c 13 c 



P 3 



3 



max{||/i||v,|bl|w r }i 



which yields P 2 {l + c l ^T 2 y 1 \r{t)\ ^ c u c{nT/(3 + (1/ Cl )) max{||/i|| y , \\g\\ w } < ma^{\\h\\ v , \\g\\w}/8, 
provided Co is small enough and Ci/c is large enough. Finally ||r||v < m a x {|Nlv, llflllw}/^ 
Using the properties of L and the fact that 



\a 2 (t).h(t) + b 2 {t)g{t)\ < ci5/i(l + ci^T 2 )//3 2 ||/i||y + a 5 (\w(t)\ + fi)\\g\\ w 



we easily derive < max{||/i||y, ||s l ||w r }/4 (again provided that Co, more precisely CqC\ is small 

enough). We have proved that for a good choice of c, ci, Co, ||D$(z)[/i, g}\\ < \\(h,g)\\/2 for z G D. Hence 
$ is a contraction. As a result, it has a unique fixed point z\ in D (which in fact belongs to .63^/4). 
This proves existence. 

Now there remains to prove that ¥V,a(£), <2Vi,a(0 are C 1 functions of (A, t). Let (0q , 9q ) be fixed with 
T := 6q - 9+ and let A = {A | \9+ - 9+\ < 1/4, \0~ - 6q\ < 1/4}. For A G A I := [-1/2,T + 1/2] C 
[— 1, 9~ — 9 + + 1], hence the restrictions and of v\ and w\ to Iq are well defined. 

Let Vq x Wo := C 1 (/o,R") x C 1 (/ ,R) be endowed with the norm || || as defined in dsTq) . Define 
* : A — > Vb x Wo by *(A) = z° x . We shall justify briefly that * is differentiable and that \\D^>\\ < 



ciefj,. z® is the unique solution in B^^ of (8T) (with T = 9~ — 9 + ), which is equivalent to (v\,w\) = 
$>(z\;6 + ,9~ ,Lp + ,ip~ , fi), where $ : x A x (0,^2) — > Vo X Wo is smooth. Now, by the previous 
step, ||D Z $|| < 1/2 everywhere, so that / — D z § is invertible. Therefore, by the Implicit Function 
Theorem, is C 1 . This proves that (A, t) 1— > ^>^\{t) (resp. (A,t) 1— > q^\{t)) and (A,t) 1— > ^p^,\{t) (resp. 
(A, t) 1— > q^xif)) have continuous partial derivatives w.r.t. A in the set {(A, t)\ — 1/2 + 0+ < t < l/2 + 9~}, 
and by the standard theory of differential equations, these partial derivatives have continuous extensions 



on {(A, £)| — l + 9 + <t<l + 9 }. Finally, by fl2.l|), (p^x an d %,x depend continuously on (X,t) 



Proof of Theorem L2. In order to prove Theorem |4.2j we need a preliminary lemma. Observe that 
A R is a finite set which is symmetric with respect to the origin. Hence, if it is not empty there exists 
p G A* R such that p ■ fl = a (A, Q, R). 

Lemma 8.2 Assume that A* R ^ and let p G A* R be such that p- CI = a := a(A,f£, R). Assume moreover 
that a > and define E :— \p] ■ Then Ao := A n E is a lattice of E. In addition : 

a 2 

^ p\p~\-R ' where f 3 = ' mi {\i- n \'i e ( A °)v3fl /2 }> ( A °)* = {q ^ E \ \/x e A Q q ■ x e z} . 

In particular a < 2/3. 

(m) a{A,n,V7R/2) < (3. 

Proof. Since A is a lattice, it is not contained in E. Hence p ■ A is a non trivial subgroup of Z, 
p ■ A = mZ for some integer m > 1, which implies that p/m € A*. But p/m ■ ft = a/m and |p/m| < i?, 
hence by the definition and the positivity of a, m — 1. As a result there exists i£A such that p ■ x = 1. 
Obviously Ao + Tlx C A. On the other hand all a; € A can be written as x = [x ■ p)x + y, where y G A, 
y ■ p = 0, i.e. y G Ao. So the reverse inclusion holds and we may write A = Ao + Tlx. As a consequence 
Ao is a lattice of E and 

A* = {r G R z I r • A C Z and r • x G Z} = {q + ap : q G A„ , a G Z - g • x}, 

A /j = {9 + a P : ?eA;, aeZ-g-i, 0< \q\ 2 + a 2 |p| 2 < i? 2 }. 
If (3 — +00 there is nothing more to prove. If j3 < +00, let q G (Ao)^^ be such that q ■ CI = j3. Let 

5 = {aGR : q + ap e A* B ] = {a e K : aeZ-q-x, \a\ < {R 2 - \q\ 2 ) 1/2 /\p\}. 

Since \q\ 2 < 3i? 2 /4, 5 D S' := (Z - q ■ x) n [— J2/2|j>|, -R/2|p|]. Hence by the definition of a, for all a G 5', 
|(<7 + ap) • Q| = |/3 + aa\ > a, i.e. /3/a ^ (—1 — a, 1 — a). 

As \p\ < R, the interval [— R/2\p\, R/2\p\] has length > 1 and must intersect (Z — q ■ x). Therefore 
S' ^ 0, more precisely S' — {u, u + 1, . . . , u + AT}, for some integer If > 0, where u = inf 5'. As a result, 

A" 



/3/a </ (J (-1 - tt - k, 1 - « - k) = (-1 - u - K, 1 



fe=0 



Now S' ("I [-1/2, 1/2] 7^ 0, hence w + A > -1/2 and -1-u-K < 0. As a consequence /3/a > 1-u. Since 
[-i?/2|p|,-i?/2|p| + l] C [-i?/2|p|, J R/2|p|] intersects Z-q-x,u< -R/2\p\ + l. Therefore /3/a > R/2\p\, 
which is (i). In particular, since \p\ < R, a < 2(3. 



Finally there exists a G [-1, 0) H (Z- q-x); q + ap G A*, and |g + ap| 2 = |o| 2 + a 2 \p\ 2 < 3R 2 /1 + R 2 = 
7i? 2 /4. Hence q + ap G Kfj R /2' We have I*- 9 + ap ^ ' ^1 = I/ 3 + aa l - & because -1 < a < and a < 2/3. 
This proves (ii). ■ 



Now we turn to the proof of Theorem |L2] We first prove that the statement is true for 1 = 1, with 
a\ = 1/2. Here A = AoZ for some Ao > 0, and A* = (\q)~ 1 Z. We can assume without loss of generality 
that n > 0. If A < 26, then for all x e R, d(x, A) < 8. Hence T(A, 17, (5) = 

If Ao > 28, then it is easy to see that T(A, fl, 8) — (Ao — 26) /ft < Ao/fi. On the other hand, 
1/Ao G AJ /25 and a(A,n, 1/(25)) = f2/A . The result follows. 

Now we assume that the statement holds true up to dimension I — 1 (I > 2). We shall prove it in 
dimension I. 

Fix R > and define 6 R = (4a 2 _ 1 /3 + 4) 1 / 2 /i?. We claim that: 
(a) If A* R = tfien TfA.n.fJfl) = 0. 



(b) If A* R ^ 0, Ze< p G A^ be such that p ■ Q = a := a(A, f2, i?), and define (3 as in lemma 8.i . Then 

T(A,n,6 R ) < max{ aT 1 ,^ 1 }. 



Postponing the proof of (a) and (b), we show how to define a/. In the case (b), by lemma 8.2 (ii), 



T(A, fi, 5r) < a(A, ft, \flRj2) 1 . This estimate obviously holds in the case (a) too. Hence for all R > 0, 

T(A, Q, (4a 2 _ 1 /3 + 4) 1/2 /i?) < a(A, fi, V7R/2)' 1 . 



As a consequence, the statement of Theorem fh| holds with a; = (V7(4a 2 _ 1 /3 + 4) 1 / 2 /2). 

There remains to prove (a) and (6). First assume that A^ = 0. Let p G A* \ {0} be such that for all 
p' G A* \ {0}, \p\ < \p'\. Then \p\ > R. Let E, A be defined from p as in lemma 

Arguing by contradiction, we assume that (Ao)y3 H ^ 2 ^ 0- By the same arguments as previously 
there exist q G (Ao)^^ and a G [-1/2, 1/2] such that q + ap e A*. But |q + ap| 2 = |g| 2 + a 2 \p\ 2 < 

'V3R/2 



(3/4)i? 2 + \p\ 2 /4 < |p| 2 and this contradicts the definition of p. Hence (Aq)*^ — and by the iterative 



hypothesis, all point of E lies at a distance from A less than 2a;_i/v / 3~R- 



From the proof of lemma |8.2] , there exists x G A such that p - x = 1 and A = Ao + Zx. Therefore for 
all x G R ( , there is x 1 G x + A such that |x' • p| < 1/2. This implies that d{x' , E) < l/(2|p|) < l/(2i?) 
and hence that d(x' , Ao) < (4a 2 _ 1 /3 + 1/4) 1 / 2 //? < 6 R . Hence the distance from any point of R' to A is 
not greater than Sr. This completes the proof of (a). 

Next assume that A* R ^ and let p be as in lemma |8.2| . Define a and (3 in the same way as in lemma 
8.2. Let i £ R 1 . Again A = Ao + Zx for some x G A such that p ■ x = 1, hence there exists x' G x + A 



such that p ■ x' G [0, 1). We have 



/ W tt a 

x = y + -^p ,n = u+ —p, 



with y,U G E = [p^ , w — p ■ x' G [0, 1). We shall assume that a > (if a = 0, there is nothing to prove). 
Let t — w/a, and consider the time interval defined by 

J =[0,1/(3] ift<l/P, J=\t-l/j3,t\ if t> 1/(3. 

J C [0, max{l//3, 1/a}], and it is enough to prove that there exists t G J such that d(x' , tQ + A ) < <Sr. 
The length of J is not less than 1/(3. Hence by the iterative hypothesis, there exists t G J such that 
d(y, tU + Ao) < 2ai-i/(VSR) (notice that for all q G Aq, q • U = q ■ Q, so that the linear flow (tU) creates 
a 2a/_i/(\/llR)-net of E/A in time (3^). We have 

« + A^. ( ^) » + * W + ^< (^) 2 + ^L. 

Hence, by lemma O («) , d(x' , tQ, + A ) < (4a 2 _ 1 /3 + 4) : / 2 /i?. This completes the proof of (b). ■ 
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